Overview
Brought to you by YData
Dataset statistics
| Number of variables | 42 |
|---|---|
| Number of observations | 612910 |
| Missing cells | 11176607 |
| Missing cells (%) | 43.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.2 GiB |
| Average record size in memory | 2.1 KiB |
Variable types
| Text | 29 |
|---|---|
| Categorical | 5 |
| DateTime | 2 |
| Numeric | 3 |
| Boolean | 3 |
relationships.company1.data.type has constant value "company" | Constant |
relationships.company2.data.type has constant value "company" | Constant |
amount_normalized is highly overall correlated with financing_type_normalized and 1 other fields | High correlation |
financing_type_normalized is highly overall correlated with amount_normalized and 2 other fields | High correlation |
headcount is highly overall correlated with financing_type_normalized and 1 other fields | High correlation |
product_data.fuzzy_match is highly overall correlated with amount_normalized and 2 other fields | High correlation |
human_approved is highly imbalanced (84.6%) | Imbalance |
planning is highly imbalanced (98.9%) | Imbalance |
financing_type_tags is highly imbalanced (93.8%) | Imbalance |
product_data.fuzzy_match is highly imbalanced (79.6%) | Imbalance |
amount has 571623 (93.3%) missing values | Missing |
amount_normalized has 571627 (93.3%) missing values | Missing |
assets has 597248 (97.4%) missing values | Missing |
award has 593924 (96.9%) missing values | Missing |
contact has 524634 (85.6%) missing values | Missing |
event has 594343 (97.0%) missing values | Missing |
financing_type has 602468 (98.3%) missing values | Missing |
financing_type_normalized has 610462 (99.6%) missing values | Missing |
job_title has 541772 (88.4%) missing values | Missing |
location has 444747 (72.6%) missing values | Missing |
product has 392072 (64.0%) missing values | Missing |
product_data.full_text has 392134 (64.0%) missing values | Missing |
product_data.name has 597801 (97.5%) missing values | Missing |
product_data.release_type has 586093 (95.6%) missing values | Missing |
product_data.release_version has 612281 (99.9%) missing values | Missing |
product_data.fuzzy_match has 392134 (64.0%) missing values | Missing |
recognition has 587807 (95.9%) missing values | Missing |
vulnerability has 600409 (98.0%) missing values | Missing |
relationships.company1.data.id has 14768 (2.4%) missing values | Missing |
relationships.company1.data.type has 14768 (2.4%) missing values | Missing |
relationships.company2.data.id has 412313 (67.3%) missing values | Missing |
relationships.company2.data.type has 412313 (67.3%) missing values | Missing |
domain has 14768 (2.4%) missing values | Missing |
company_name has 14875 (2.4%) missing values | Missing |
ticker has 479215 (78.2%) missing values | Missing |
amount_normalized is highly skewed (γ1 = 151.4791593) | Skewed |
headcount is highly skewed (γ1 = 120.2437773) | Skewed |
Primary_ID has unique values | Unique |
confidence has 28568 (4.7%) zeros | Zeros |
headcount has 606682 (99.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-08-29 04:07:33.644681 |
|---|---|
| Analysis finished | 2025-08-29 04:10:24.288923 |
| Duration | 2 minutes and 50.64 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Primary_ID
Text
Unique 
| Distinct | 612910 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 612910 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0020f127-3470-4cce-8989-1c79f45da217 |
|---|---|
| 2nd row | 009be1ff-6cfb-4e9f-a415-69baf71f47f3 |
| 3rd row | 01444124-7375-4f03-8879-eb8200b31504 |
| 4th row | 031a304c-29ca-415e-a815-e9c915896540 |
| 5th row | 037783ca-f3f7-4782-8a81-df3cae1ac936 |
| Value | Count | Frequency (%) |
| 08293af4-4dea-4eb8-a6a3-a925246c4d9a | 1 | < 0.1% |
| a3d9d83c-47d6-4c5a-b87c-ae3e52484f01 | 1 | < 0.1% |
| 0020f127-3470-4cce-8989-1c79f45da217 | 1 | < 0.1% |
| 009be1ff-6cfb-4e9f-a415-69baf71f47f3 | 1 | < 0.1% |
| 01444124-7375-4f03-8879-eb8200b31504 | 1 | < 0.1% |
| 031a304c-29ca-415e-a815-e9c915896540 | 1 | < 0.1% |
| 037783ca-f3f7-4782-8a81-df3cae1ac936 | 1 | < 0.1% |
| 03d14654-015f-4efa-b986-05a6b032e8ea | 1 | < 0.1% |
| 04143a02-d0a8-4079-97f1-35bc1497bfb9 | 1 | < 0.1% |
| 0493a8e0-6cb2-4a0c-9cff-9076252a963d | 1 | < 0.1% |
| Other values (612900) | 612900 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 2451640 | 11.1% |
| 4 | 1759767 | 8.0% |
| b | 1302685 | 5.9% |
| 8 | 1302655 | 5.9% |
| 9 | 1302417 | 5.9% |
| a | 1301566 | 5.9% |
| f | 1152251 | 5.2% |
| 1 | 1150254 | 5.2% |
| c | 1150153 | 5.2% |
| e | 1149878 | 5.2% |
| Other values (7) | 8041494 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22064760 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 2451640 | 11.1% |
| 4 | 1759767 | 8.0% |
| b | 1302685 | 5.9% |
| 8 | 1302655 | 5.9% |
| 9 | 1302417 | 5.9% |
| a | 1301566 | 5.9% |
| f | 1152251 | 5.2% |
| 1 | 1150254 | 5.2% |
| c | 1150153 | 5.2% |
| e | 1149878 | 5.2% |
| Other values (7) | 8041494 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22064760 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 2451640 | 11.1% |
| 4 | 1759767 | 8.0% |
| b | 1302685 | 5.9% |
| 8 | 1302655 | 5.9% |
| 9 | 1302417 | 5.9% |
| a | 1301566 | 5.9% |
| f | 1152251 | 5.2% |
| 1 | 1150254 | 5.2% |
| c | 1150153 | 5.2% |
| e | 1149878 | 5.2% |
| Other values (7) | 8041494 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22064760 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 2451640 | 11.1% |
| 4 | 1759767 | 8.0% |
| b | 1302685 | 5.9% |
| 8 | 1302655 | 5.9% |
| 9 | 1302417 | 5.9% |
| a | 1301566 | 5.9% |
| f | 1152251 | 5.2% |
| 1 | 1150254 | 5.2% |
| c | 1150153 | 5.2% |
| e | 1149878 | 5.2% |
| Other values (7) | 8041494 |
summary
Text
| Distinct | 603796 |
|---|---|
| Distinct (%) | 98.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.3 MiB |
Length
| Max length | 461 |
|---|---|
| Median length | 267 |
| Mean length | 65.628552 |
| Min length | 13 |
Unique
| Unique | 596279 ? |
|---|---|
| Unique (%) | 97.3% |
Sample
| 1st row | Unipart Manufacturing Group recognized as Transport and Storage sector winner. |
|---|---|
| 2nd row | OOS International received award two safety awards on Jan 1st '18. |
| 3rd row | NWN Corporation received award Global Winner for 2022 Microsoft Meetings, Calling & Devices for Microsoft Teams Partner of the Year Award on Jun 28th '22. |
| 4th row | Grape Solutions Plc. is developing Mobiliti app on Jan 1st '18. |
| 5th row | NWN Corporation launched two new kits, At-Home Essentials and Office Collaboration Room-as-a-Service on Apr 13th '22. |
| Value | Count | Frequency (%) |
| on | 253387 | 4.1% |
| with | 140619 | 2.3% |
| of | 118920 | 1.9% |
| inc | 111918 | 1.8% |
| launches | 108286 | 1.8% |
| as | 97730 | 1.6% |
| launched | 91532 | 1.5% |
| 1st | 89146 | 1.5% |
| partners | 85566 | 1.4% |
| the | 76885 | 1.3% |
| Other values (229409) | 4945994 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5507371 | 13.7% | |
| e | 3222999 | 8.0% |
| n | 2678470 | 6.7% |
| a | 2497542 | 6.2% |
| t | 2332005 | 5.8% |
| i | 2275691 | 5.7% |
| o | 2163941 | 5.4% |
| r | 2090734 | 5.2% |
| s | 1869253 | 4.6% |
| c | 1220028 | 3.0% |
| Other values (521) | 14366362 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 40224396 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5507371 | 13.7% | |
| e | 3222999 | 8.0% |
| n | 2678470 | 6.7% |
| a | 2497542 | 6.2% |
| t | 2332005 | 5.8% |
| i | 2275691 | 5.7% |
| o | 2163941 | 5.4% |
| r | 2090734 | 5.2% |
| s | 1869253 | 4.6% |
| c | 1220028 | 3.0% |
| Other values (521) | 14366362 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 40224396 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5507371 | 13.7% | |
| e | 3222999 | 8.0% |
| n | 2678470 | 6.7% |
| a | 2497542 | 6.2% |
| t | 2332005 | 5.8% |
| i | 2275691 | 5.7% |
| o | 2163941 | 5.4% |
| r | 2090734 | 5.2% |
| s | 1869253 | 4.6% |
| c | 1220028 | 3.0% |
| Other values (521) | 14366362 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 40224396 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5507371 | 13.7% | |
| e | 3222999 | 8.0% |
| n | 2678470 | 6.7% |
| a | 2497542 | 6.2% |
| t | 2332005 | 5.8% |
| i | 2275691 | 5.7% |
| o | 2163941 | 5.4% |
| r | 2090734 | 5.2% |
| s | 1869253 | 4.6% |
| c | 1220028 | 3.0% |
| Other values (521) | 14366362 |
category
Categorical
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.4 MiB |
| launches | |
|---|---|
| partners_with | |
| hires | |
| invests_into | |
| recognized_as | |
| Other values (24) |
Length
| Max length | 27 |
|---|---|
| Median length | 22 |
| Mean length | 10.881691 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | recognized_as |
|---|---|
| 2nd row | receives_award |
| 3rd row | receives_award |
| 4th row | is_developing |
| 5th row | launches |
Common Values
| Value | Count | Frequency (%) |
| launches | 199986 | |
| partners_with | 118857 | |
| hires | 65431 | 10.7% |
| invests_into | 25274 | 4.1% |
| recognized_as | 25103 | 4.1% |
| is_developing | 20850 | 3.4% |
| receives_award | 18985 | 3.1% |
| acquires | 18181 | 3.0% |
| invests_into_assets | 14405 | 2.4% |
| has_issues_with | 12501 | 2.0% |
| Other values (19) | 93337 |
Length
| Value | Count | Frequency (%) |
| launches | 199986 | |
| partners_with | 118857 | |
| hires | 65431 | 10.7% |
| invests_into | 25274 | 4.1% |
| recognized_as | 25103 | 4.1% |
| is_developing | 20850 | 3.4% |
| receives_award | 18985 | 3.1% |
| acquires | 18181 | 3.0% |
| invests_into_assets | 14405 | 2.4% |
| has_issues_with | 12501 | 2.0% |
| Other values (19) | 93337 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 828406 | |
| s | 794048 | |
| n | 571724 | |
| i | 517880 | 7.8% |
| a | 513942 | 7.7% |
| t | 454317 | 6.8% |
| r | 427219 | 6.4% |
| h | 420320 | 6.3% |
| _ | 384979 | 5.8% |
| c | 331733 | 5.0% |
| Other values (15) | 1424929 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6669497 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 828406 | |
| s | 794048 | |
| n | 571724 | |
| i | 517880 | 7.8% |
| a | 513942 | 7.7% |
| t | 454317 | 6.8% |
| r | 427219 | 6.4% |
| h | 420320 | 6.3% |
| _ | 384979 | 5.8% |
| c | 331733 | 5.0% |
| Other values (15) | 1424929 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6669497 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 828406 | |
| s | 794048 | |
| n | 571724 | |
| i | 517880 | 7.8% |
| a | 513942 | 7.7% |
| t | 454317 | 6.8% |
| r | 427219 | 6.4% |
| h | 420320 | 6.3% |
| _ | 384979 | 5.8% |
| c | 331733 | 5.0% |
| Other values (15) | 1424929 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6669497 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 828406 | |
| s | 794048 | |
| n | 571724 | |
| i | 517880 | 7.8% |
| a | 513942 | 7.7% |
| t | 454317 | 6.8% |
| r | 427219 | 6.4% |
| h | 420320 | 6.3% |
| _ | 384979 | 5.8% |
| c | 331733 | 5.0% |
| Other values (15) | 1424929 |
found_at
Date
| Distinct | 384036 |
|---|---|
| Distinct (%) | 62.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.4 MiB |
| Minimum | 2010-01-05 00:00:00+00:00 |
|---|---|
| Maximum | 2025-07-07 14:52:48+00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
confidence
Real number (ℝ)
Zeros 
| Distinct | 10001 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 8 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6043837 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 28568 |
| Zeros (%) | 4.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.012 |
| Q1 | 0.45 |
| median | 0.6489 |
| Q3 | 0.7997 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.3497 |
Descriptive statistics
| Standard deviation | 0.27093289 |
|---|---|
| Coefficient of variation (CV) | 0.44827962 |
| Kurtosis | -0.32263711 |
| Mean | 0.6043837 |
| Median Absolute Deviation (MAD) | 0.1699 |
| Skewness | -0.60381193 |
| Sum | 370427.98 |
| Variance | 0.073404633 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 38538 | 6.3% |
| 0 | 28568 | 4.7% |
| 0.5 | 1222 | 0.2% |
| 0.6726 | 1037 | 0.2% |
| 0.6084 | 979 | 0.2% |
| 0.7529 | 627 | 0.1% |
| 0.7282 | 608 | 0.1% |
| 0.6375 | 588 | 0.1% |
| 0.7347 | 564 | 0.1% |
| 0.5747 | 554 | 0.1% |
| Other values (9991) | 539617 |
| Value | Count | Frequency (%) |
| 0 | 28568 | |
| 0.0001 | 24 | < 0.1% |
| 0.0002 | 19 | < 0.1% |
| 0.0003 | 8 | < 0.1% |
| 0.0004 | 24 | < 0.1% |
| 0.0005 | 10 | < 0.1% |
| 0.0006 | 10 | < 0.1% |
| 0.0007 | 20 | < 0.1% |
| 0.0008 | 35 | < 0.1% |
| 0.0009 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 38538 | |
| 0.9999 | 52 | < 0.1% |
| 0.9998 | 41 | < 0.1% |
| 0.9997 | 55 | < 0.1% |
| 0.9996 | 30 | < 0.1% |
| 0.9995 | 48 | < 0.1% |
| 0.9994 | 33 | < 0.1% |
| 0.9993 | 44 | < 0.1% |
| 0.9992 | 27 | < 0.1% |
| 0.9991 | 64 | < 0.1% |
article_sentence
Text
| Distinct | 589468 |
|---|---|
| Distinct (%) | 96.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 178.4 MiB |
Length
| Max length | 895 |
|---|---|
| Median length | 554 |
| Mean length | 162.92406 |
| Min length | 14 |
Unique
| Unique | 568980 ? |
|---|---|
| Unique (%) | 92.8% |
Sample
| 1st row | In addition to being named the safest organisation in the UK, Unipart Logistics won the British Safety Council Chief Adjudicator Award for achieving the highest-scoring application of the 647 received from around the world, and was named Transport and Storage sector winner. |
|---|---|
| 2nd row | Since then OOS International has been an active member of the IADC and received two safety awards in 2018. |
| 3rd row | As a result, with nearly 400 nominees from over 100 countries, NWN Corporation is pleased to announce NWN Carousel was recognized as a Global Winner for 2022 Microsoft Meetings, Calling & Devices for Microsoft Teams Partner of the Year Award. |
| 4th row | MVM Mobiliti and Grape Solutions have been working together since 2018 to develop the Mobiliti app, becoming the most downloaded electric car charging app in Hungary, with more than 215,000 charging stations in 39 countries. |
| 5th row | NWN Carousel, the leading integrated cloud communications service provider, today announced two new kits, At-Home Essentials and Office Collaboration Room-as-a-Service, for organizations to manage the accelerating demands of the hybrid workplace with connectivity, security, devices and visual collaboration. |
| Value | Count | Frequency (%) |
| the | 709462 | 4.7% |
| and | 413318 | 2.7% |
| to | 398940 | 2.6% |
| of | 397305 | 2.6% |
| in | 340668 | 2.2% |
| a | 340511 | 2.2% |
| has | 224946 | 1.5% |
| with | 223187 | 1.5% |
| for | 195680 | 1.3% |
| its | 164231 | 1.1% |
| Other values (383319) | 11817085 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14609051 | ||
| e | 8819245 | 8.8% |
| a | 6788917 | 6.8% |
| n | 6389938 | 6.4% |
| t | 6363283 | 6.4% |
| i | 6146753 | 6.2% |
| o | 5728229 | 5.7% |
| r | 5182646 | 5.2% |
| s | 4646458 | 4.7% |
| l | 3283642 | 3.3% |
| Other values (945) | 31899624 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 99857786 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 14609051 | ||
| e | 8819245 | 8.8% |
| a | 6788917 | 6.8% |
| n | 6389938 | 6.4% |
| t | 6363283 | 6.4% |
| i | 6146753 | 6.2% |
| o | 5728229 | 5.7% |
| r | 5182646 | 5.2% |
| s | 4646458 | 4.7% |
| l | 3283642 | 3.3% |
| Other values (945) | 31899624 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 99857786 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 14609051 | ||
| e | 8819245 | 8.8% |
| a | 6788917 | 6.8% |
| n | 6389938 | 6.4% |
| t | 6363283 | 6.4% |
| i | 6146753 | 6.2% |
| o | 5728229 | 5.7% |
| r | 5182646 | 5.2% |
| s | 4646458 | 4.7% |
| l | 3283642 | 3.3% |
| Other values (945) | 31899624 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 99857786 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 14609051 | ||
| e | 8819245 | 8.8% |
| a | 6788917 | 6.8% |
| n | 6389938 | 6.4% |
| t | 6363283 | 6.4% |
| i | 6146753 | 6.2% |
| o | 5728229 | 5.7% |
| r | 5182646 | 5.2% |
| s | 4646458 | 4.7% |
| l | 3283642 | 3.3% |
| Other values (945) | 31899624 |
human_approved
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| False | |
|---|---|
| True | 13702 |
| Value | Count | Frequency (%) |
| False | 599208 | |
| True | 13702 | 2.2% |
planning
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| False | |
|---|---|
| True | 575 |
| Value | Count | Frequency (%) |
| False | 612335 | |
| True | 575 | 0.1% |
amount
Text
Missing 
| Distinct | 11112 |
|---|---|
| Distinct (%) | 26.9% |
| Missing | 571623 |
| Missing (%) | 93.3% |
| Memory size | 24.9 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 31 |
| Mean length | 10.522392 |
| Min length | 2 |
Unique
| Unique | 7383 ? |
|---|---|
| Unique (%) | 17.9% |
Sample
| 1st row | $155,000 |
|---|---|
| 2nd row | $1m |
| 3rd row | 8.8 billion baht |
| 4th row | $2.5M |
| 5th row | $32,000 |
| Value | Count | Frequency (%) |
| million | 24157 | |
| billion | 4470 | 6.0% |
| 1 | 851 | 1.1% |
| crore | 717 | 1.0% |
| 100 | 674 | 0.9% |
| 10 | 668 | 0.9% |
| rs | 600 | 0.8% |
| usd | 595 | 0.8% |
| 5 | 502 | 0.7% |
| 2 | 493 | 0.7% |
| Other values (8460) | 40382 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 59004 | |
| i | 58303 | |
| $ | 34789 | 8.0% |
| 0 | 33798 | 7.8% |
| 32818 | 7.6% | |
| o | 30644 | 7.1% |
| n | 30182 | 6.9% |
| m | 27104 | 6.2% |
| 1 | 16411 | 3.8% |
| 5 | 14718 | 3.4% |
| Other values (77) | 96667 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 434438 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 59004 | |
| i | 58303 | |
| $ | 34789 | 8.0% |
| 0 | 33798 | 7.8% |
| 32818 | 7.6% | |
| o | 30644 | 7.1% |
| n | 30182 | 6.9% |
| m | 27104 | 6.2% |
| 1 | 16411 | 3.8% |
| 5 | 14718 | 3.4% |
| Other values (77) | 96667 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 434438 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 59004 | |
| i | 58303 | |
| $ | 34789 | 8.0% |
| 0 | 33798 | 7.8% |
| 32818 | 7.6% | |
| o | 30644 | 7.1% |
| n | 30182 | 6.9% |
| m | 27104 | 6.2% |
| 1 | 16411 | 3.8% |
| 5 | 14718 | 3.4% |
| Other values (77) | 96667 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 434438 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 59004 | |
| i | 58303 | |
| $ | 34789 | 8.0% |
| 0 | 33798 | 7.8% |
| 32818 | 7.6% | |
| o | 30644 | 7.1% |
| n | 30182 | 6.9% |
| m | 27104 | 6.2% |
| 1 | 16411 | 3.8% |
| 5 | 14718 | 3.4% |
| Other values (77) | 96667 |
amount_normalized
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 8246 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 571627 |
| Missing (%) | 93.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4061063 × 1010 |
| Minimum | -6 × 109 |
|---|---|
| Maximum | 7.5 × 1014 |
| Zeros | 359 |
| Zeros (%) | 0.1% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | -6 × 109 |
|---|---|
| 5-th percentile | 72000 |
| Q1 | 4207000 |
| median | 35000000 |
| Q3 | 2.25 × 108 |
| 95-th percentile | 3.6 × 109 |
| Maximum | 7.5 × 1014 |
| Range | 7.50006 × 1014 |
| Interquartile range (IQR) | 2.20793 × 108 |
Descriptive statistics
| Standard deviation | 4.4385953 × 1012 |
|---|---|
| Coefficient of variation (CV) | 130.31288 |
| Kurtosis | 23643.49 |
| Mean | 3.4061063 × 1010 |
| Median Absolute Deviation (MAD) | 34700000 |
| Skewness | 151.47916 |
| Sum | 1.4061429 × 1015 |
| Variance | 1.9701128 × 1025 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000000 | 734 | 0.1% |
| 10000000 | 595 | 0.1% |
| 50000000 | 576 | 0.1% |
| 1000000 | 553 | 0.1% |
| 20000000 | 548 | 0.1% |
| 1000000000 | 522 | 0.1% |
| 5000000 | 485 | 0.1% |
| 200000000 | 469 | 0.1% |
| 30000000 | 465 | 0.1% |
| 15000000 | 443 | 0.1% |
| Other values (8236) | 35893 | 5.9% |
| (Missing) | 571627 |
| Value | Count | Frequency (%) |
| -6000000000 | 1 | < 0.1% |
| 0 | 359 | |
| 1000 | 61 | < 0.1% |
| 2000 | 44 | < 0.1% |
| 3000 | 28 | < 0.1% |
| 4000 | 18 | < 0.1% |
| 5000 | 68 | < 0.1% |
| 6000 | 17 | < 0.1% |
| 7000 | 13 | < 0.1% |
| 8000 | 20 | < 0.1% |
| Value | Count | Frequency (%) |
| 7.5 × 1014 | 1 | |
| 5 × 1014 | 1 | |
| 1.5 × 1013 | 1 | |
| 1.13 × 1013 | 2 | |
| 9.2 × 1012 | 1 | |
| 9 × 1012 | 2 | |
| 6.66 × 1012 | 1 | |
| 4.2 × 1012 | 1 | |
| 3.8 × 1012 | 1 | |
| 3.3 × 1012 | 1 |
assets
Text
Missing 
| Distinct | 10145 |
|---|---|
| Distinct (%) | 64.8% |
| Missing | 597248 |
| Missing (%) | 97.4% |
| Memory size | 24.1 MiB |
Length
| Max length | 108 |
|---|---|
| Median length | 80 |
| Mean length | 22.466032 |
| Min length | 2 |
Unique
| Unique | 8608 ? |
|---|---|
| Unique (%) | 55.0% |
Sample
| 1st row | energy vehicle (NEV) production plant |
|---|---|
| 2nd row | EV production facilities |
| 3rd row | energy vehicle production facility |
| 4th row | research center |
| 5th row | data center |
| Value | Count | Frequency (%) |
| in | 2296 | 4.6% |
| stake | 2079 | 4.2% |
| facility | 1737 | 3.5% |
| plant | 1380 | 2.8% |
| and | 1308 | 2.6% |
| center | 860 | 1.7% |
| manufacturing | 700 | 1.4% |
| facilities | 655 | 1.3% |
| of | 551 | 1.1% |
| building | 519 | 1.0% |
| Other values (7931) | 37368 |
Most occurring characters
| Value | Count | Frequency (%) |
| 33787 | 9.6% | |
| e | 31513 | 9.0% |
| t | 27107 | 7.7% |
| i | 26908 | 7.6% |
| a | 26798 | 7.6% |
| n | 22782 | 6.5% |
| r | 21444 | 6.1% |
| s | 18486 | 5.3% |
| o | 17589 | 5.0% |
| l | 15163 | 4.3% |
| Other values (103) | 110286 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 351863 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 33787 | 9.6% | |
| e | 31513 | 9.0% |
| t | 27107 | 7.7% |
| i | 26908 | 7.6% |
| a | 26798 | 7.6% |
| n | 22782 | 6.5% |
| r | 21444 | 6.1% |
| s | 18486 | 5.3% |
| o | 17589 | 5.0% |
| l | 15163 | 4.3% |
| Other values (103) | 110286 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 351863 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 33787 | 9.6% | |
| e | 31513 | 9.0% |
| t | 27107 | 7.7% |
| i | 26908 | 7.6% |
| a | 26798 | 7.6% |
| n | 22782 | 6.5% |
| r | 21444 | 6.1% |
| s | 18486 | 5.3% |
| o | 17589 | 5.0% |
| l | 15163 | 4.3% |
| Other values (103) | 110286 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 351863 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 33787 | 9.6% | |
| e | 31513 | 9.0% |
| t | 27107 | 7.7% |
| i | 26908 | 7.6% |
| a | 26798 | 7.6% |
| n | 22782 | 6.5% |
| r | 21444 | 6.1% |
| s | 18486 | 5.3% |
| o | 17589 | 5.0% |
| l | 15163 | 4.3% |
| Other values (103) | 110286 |
assets_tags
Text
| Distinct | 156 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.7 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 0 |
| Mean length | 1.1314565 |
| Min length | 0 |
Unique
| Unique | 54 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row | office |
| Value | Count | Frequency (%) |
| education | 14183 | |
| research_and_development | 10530 | |
| it | 8492 | |
| production | 7265 | |
| hospitality | 4300 | 6.9% |
| retail | 3939 | 6.3% |
| transportation | 3610 | 5.8% |
| energy | 3226 | 5.2% |
| distribution | 2833 | 4.6% |
| office | 2230 | 3.6% |
| Other values (2) | 1465 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 83807 | |
| t | 70928 | |
| n | 58633 | 8.5% |
| i | 58283 | 8.4% |
| o | 55826 | 8.1% |
| a | 53548 | 7.7% |
| r | 47008 | 6.8% |
| d | 45341 | 6.5% |
| c | 34250 | 4.9% |
| p | 25705 | 3.7% |
| Other values (13) | 160152 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 693481 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 83807 | |
| t | 70928 | |
| n | 58633 | 8.5% |
| i | 58283 | 8.4% |
| o | 55826 | 8.1% |
| a | 53548 | 7.7% |
| r | 47008 | 6.8% |
| d | 45341 | 6.5% |
| c | 34250 | 4.9% |
| p | 25705 | 3.7% |
| Other values (13) | 160152 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 693481 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 83807 | |
| t | 70928 | |
| n | 58633 | 8.5% |
| i | 58283 | 8.4% |
| o | 55826 | 8.1% |
| a | 53548 | 7.7% |
| r | 47008 | 6.8% |
| d | 45341 | 6.5% |
| c | 34250 | 4.9% |
| p | 25705 | 3.7% |
| Other values (13) | 160152 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 693481 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 83807 | |
| t | 70928 | |
| n | 58633 | 8.5% |
| i | 58283 | 8.4% |
| o | 55826 | 8.1% |
| a | 53548 | 7.7% |
| r | 47008 | 6.8% |
| d | 45341 | 6.5% |
| c | 34250 | 4.9% |
| p | 25705 | 3.7% |
| Other values (13) | 160152 |
award
Text
Missing 
| Distinct | 16536 |
|---|---|
| Distinct (%) | 87.1% |
| Missing | 593924 |
| Missing (%) | 96.9% |
| Memory size | 24.7 MiB |
Length
| Max length | 373 |
|---|---|
| Median length | 170 |
| Mean length | 37.589329 |
| Min length | 4 |
Unique
| Unique | 15473 ? |
|---|---|
| Unique (%) | 81.5% |
Sample
| 1st row | two safety awards |
|---|---|
| 2nd row | Global Winner for 2022 Microsoft Meetings, Calling & Devices for Microsoft Teams Partner of the Year Award |
| 3rd row | National Science and Technology Progress Award |
| 4th row | SMB Partner of the Year |
| 5th row | Faculty of Medicine and Health Sciences |
| Value | Count | Frequency (%) |
| award | 10547 | 9.7% |
| the | 5409 | 5.0% |
| of | 4618 | 4.2% |
| for | 3560 | 3.3% |
| in | 3052 | 2.8% |
| awards | 3020 | 2.8% |
| year | 2847 | 2.6% |
| best | 2498 | 2.3% |
| excellence | 1761 | 1.6% |
| and | 1687 | 1.6% |
| Other values (10526) | 69666 |
Most occurring characters
| Value | Count | Frequency (%) |
| 89654 | 12.6% | |
| e | 61313 | 8.6% |
| a | 53786 | 7.5% |
| r | 50056 | 7.0% |
| n | 39775 | 5.6% |
| t | 39328 | 5.5% |
| o | 38788 | 5.4% |
| i | 37562 | 5.3% |
| d | 25170 | 3.5% |
| s | 23618 | 3.3% |
| Other values (131) | 254621 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 713671 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 89654 | 12.6% | |
| e | 61313 | 8.6% |
| a | 53786 | 7.5% |
| r | 50056 | 7.0% |
| n | 39775 | 5.6% |
| t | 39328 | 5.5% |
| o | 38788 | 5.4% |
| i | 37562 | 5.3% |
| d | 25170 | 3.5% |
| s | 23618 | 3.3% |
| Other values (131) | 254621 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 713671 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 89654 | 12.6% | |
| e | 61313 | 8.6% |
| a | 53786 | 7.5% |
| r | 50056 | 7.0% |
| n | 39775 | 5.6% |
| t | 39328 | 5.5% |
| o | 38788 | 5.4% |
| i | 37562 | 5.3% |
| d | 25170 | 3.5% |
| s | 23618 | 3.3% |
| Other values (131) | 254621 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 713671 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 89654 | 12.6% | |
| e | 61313 | 8.6% |
| a | 53786 | 7.5% |
| r | 50056 | 7.0% |
| n | 39775 | 5.6% |
| t | 39328 | 5.5% |
| o | 38788 | 5.4% |
| i | 37562 | 5.3% |
| d | 25170 | 3.5% |
| s | 23618 | 3.3% |
| Other values (131) | 254621 |
contact
Text
Missing 
| Distinct | 72593 |
|---|---|
| Distinct (%) | 82.2% |
| Missing | 524634 |
| Missing (%) | 85.6% |
| Memory size | 26.6 MiB |
Length
| Max length | 183 |
|---|---|
| Median length | 43 |
| Mean length | 12.132992 |
| Min length | 2 |
Unique
| Unique | 64152 ? |
|---|---|
| Unique (%) | 72.7% |
Sample
| 1st row | Jim Sullivan |
|---|---|
| 2nd row | Dean Fernandes |
| 3rd row | Darren Leigh |
| 4th row | Criss Edwards |
| 5th row | Bryan Jackson CBE |
| Value | Count | Frequency (%) |
| david | 1438 | 0.8% |
| john | 1317 | 0.8% |
| michael | 1068 | 0.6% |
| mark | 868 | 0.5% |
| chris | 754 | 0.4% |
| paul | 752 | 0.4% |
| james | 735 | 0.4% |
| andrew | 686 | 0.4% |
| scott | 647 | 0.4% |
| mike | 620 | 0.4% |
| Other values (45997) | 160490 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 102373 | 9.6% |
| e | 94889 | 8.9% |
| 81082 | 7.6% | |
| n | 75818 | 7.1% |
| r | 69718 | 6.5% |
| i | 67637 | 6.3% |
| o | 55481 | 5.2% |
| l | 49480 | 4.6% |
| t | 37775 | 3.5% |
| s | 37525 | 3.5% |
| Other values (156) | 399274 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1071052 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 102373 | 9.6% |
| e | 94889 | 8.9% |
| 81082 | 7.6% | |
| n | 75818 | 7.1% |
| r | 69718 | 6.5% |
| i | 67637 | 6.3% |
| o | 55481 | 5.2% |
| l | 49480 | 4.6% |
| t | 37775 | 3.5% |
| s | 37525 | 3.5% |
| Other values (156) | 399274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1071052 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 102373 | 9.6% |
| e | 94889 | 8.9% |
| 81082 | 7.6% | |
| n | 75818 | 7.1% |
| r | 69718 | 6.5% |
| i | 67637 | 6.3% |
| o | 55481 | 5.2% |
| l | 49480 | 4.6% |
| t | 37775 | 3.5% |
| s | 37525 | 3.5% |
| Other values (156) | 399274 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1071052 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 102373 | 9.6% |
| e | 94889 | 8.9% |
| 81082 | 7.6% | |
| n | 75818 | 7.1% |
| r | 69718 | 6.5% |
| i | 67637 | 6.3% |
| o | 55481 | 5.2% |
| l | 49480 | 4.6% |
| t | 37775 | 3.5% |
| s | 37525 | 3.5% |
| Other values (156) | 399274 |
effective_date
Date
| Distinct | 5562 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.4 MiB |
| Minimum | 1916-01-01 00:00:00 |
|---|---|
| Maximum | 2033-01-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
event
Text
Missing 
| Distinct | 15277 |
|---|---|
| Distinct (%) | 82.3% |
| Missing | 594343 |
| Missing (%) | 97.0% |
| Memory size | 24.4 MiB |
Length
| Max length | 152 |
|---|---|
| Median length | 105 |
| Mean length | 29.611353 |
| Min length | 3 |
Unique
| Unique | 13438 ? |
|---|---|
| Unique (%) | 72.4% |
Sample
| 1st row | Australian Specialist Hub |
|---|---|
| 2nd row | 2024 Five9 Global Partner Awards |
| 3rd row | Accenture HealthTech Innovation Challenge |
| 4th row | Accenture HealthTech Innovation Challenge |
| 5th row | 2021 Australian Space Awards |
| Value | Count | Frequency (%) |
| awards | 7090 | 8.8% |
| 2025 | 3924 | 4.9% |
| conference | 1717 | 2.1% |
| annual | 1547 | 1.9% |
| 2024 | 1331 | 1.7% |
| and | 961 | 1.2% |
| 918 | 1.1% | |
| world | 908 | 1.1% |
| summit | 815 | 1.0% |
| of | 793 | 1.0% |
| Other values (10822) | 60226 |
Most occurring characters
| Value | Count | Frequency (%) |
| 61609 | 11.2% | |
| e | 39905 | 7.3% |
| a | 37735 | 6.9% |
| n | 35088 | 6.4% |
| r | 30742 | 5.6% |
| i | 27018 | 4.9% |
| o | 26363 | 4.8% |
| s | 24948 | 4.5% |
| t | 23939 | 4.4% |
| l | 17663 | 3.2% |
| Other values (130) | 224784 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 549794 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 61609 | 11.2% | |
| e | 39905 | 7.3% |
| a | 37735 | 6.9% |
| n | 35088 | 6.4% |
| r | 30742 | 5.6% |
| i | 27018 | 4.9% |
| o | 26363 | 4.8% |
| s | 24948 | 4.5% |
| t | 23939 | 4.4% |
| l | 17663 | 3.2% |
| Other values (130) | 224784 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 549794 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 61609 | 11.2% | |
| e | 39905 | 7.3% |
| a | 37735 | 6.9% |
| n | 35088 | 6.4% |
| r | 30742 | 5.6% |
| i | 27018 | 4.9% |
| o | 26363 | 4.8% |
| s | 24948 | 4.5% |
| t | 23939 | 4.4% |
| l | 17663 | 3.2% |
| Other values (130) | 224784 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 549794 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 61609 | 11.2% | |
| e | 39905 | 7.3% |
| a | 37735 | 6.9% |
| n | 35088 | 6.4% |
| r | 30742 | 5.6% |
| i | 27018 | 4.9% |
| o | 26363 | 4.8% |
| s | 24948 | 4.5% |
| t | 23939 | 4.4% |
| l | 17663 | 3.2% |
| Other values (130) | 224784 |
financing_type
Text
Missing 
| Distinct | 1210 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 602468 |
| Missing (%) | 98.3% |
| Memory size | 23.7 MiB |
Length
| Max length | 58 |
|---|---|
| Median length | 51 |
| Mean length | 10.522122 |
| Min length | 3 |
Unique
| Unique | 893 ? |
|---|---|
| Unique (%) | 8.6% |
Sample
| 1st row | donations |
|---|---|
| 2nd row | grant funding |
| 3rd row | grant |
| 4th row | grant |
| 5th row | grant |
| Value | Count | Frequency (%) |
| funding | 2303 | 12.8% |
| series | 1945 | 10.8% |
| grant | 1837 | 10.2% |
| public | 842 | 4.7% |
| offering | 811 | 4.5% |
| ipo | 744 | 4.1% |
| initial | 712 | 4.0% |
| a | 682 | 3.8% |
| round | 610 | 3.4% |
| seed | 516 | 2.9% |
| Other values (748) | 7006 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 13650 | |
| i | 11662 | 10.6% |
| e | 10033 | 9.1% |
| 7566 | 6.9% | |
| r | 7278 | 6.6% |
| t | 6570 | 6.0% |
| g | 6283 | 5.7% |
| a | 5490 | 5.0% |
| f | 5437 | 4.9% |
| d | 4841 | 4.4% |
| Other values (63) | 31062 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 109872 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 13650 | |
| i | 11662 | 10.6% |
| e | 10033 | 9.1% |
| 7566 | 6.9% | |
| r | 7278 | 6.6% |
| t | 6570 | 6.0% |
| g | 6283 | 5.7% |
| a | 5490 | 5.0% |
| f | 5437 | 4.9% |
| d | 4841 | 4.4% |
| Other values (63) | 31062 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 109872 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 13650 | |
| i | 11662 | 10.6% |
| e | 10033 | 9.1% |
| 7566 | 6.9% | |
| r | 7278 | 6.6% |
| t | 6570 | 6.0% |
| g | 6283 | 5.7% |
| a | 5490 | 5.0% |
| f | 5437 | 4.9% |
| d | 4841 | 4.4% |
| Other values (63) | 31062 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 109872 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 13650 | |
| i | 11662 | 10.6% |
| e | 10033 | 9.1% |
| 7566 | 6.9% | |
| r | 7278 | 6.6% |
| t | 6570 | 6.0% |
| g | 6283 | 5.7% |
| a | 5490 | 5.0% |
| f | 5437 | 4.9% |
| d | 4841 | 4.4% |
| Other values (63) | 31062 |
financing_type_normalized
Categorical
High correlation  Missing 
| Distinct | 28 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 610462 |
| Missing (%) | 99.6% |
| Memory size | 37.4 MiB |
| series_a | |
|---|---|
| series_b | |
| seed | |
| series_c | |
| series_d | |
| Other values (23) |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 7.3419118 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | seed |
|---|---|
| 2nd row | seed |
| 3rd row | series_e |
| 4th row | series_b |
| 5th row | series_b |
Common Values
| Value | Count | Frequency (%) |
| series_a | 635 | 0.1% |
| series_b | 503 | 0.1% |
| seed | 453 | 0.1% |
| series_c | 331 | 0.1% |
| series_d | 184 | < 0.1% |
| series_e | 88 | < 0.1% |
| pre_seed | 62 | < 0.1% |
| series_f | 61 | < 0.1% |
| pre_series_a | 39 | < 0.1% |
| series_g | 20 | < 0.1% |
| Other values (18) | 72 | < 0.1% |
| (Missing) | 610462 |
Length
| Value | Count | Frequency (%) |
| series_a | 635 | |
| series_b | 503 | |
| seed | 453 | |
| series_c | 331 | |
| series_d | 184 | 7.5% |
| series_e | 88 | 3.6% |
| pre_seed | 62 | 2.5% |
| series_f | 61 | 2.5% |
| pre_series_a | 39 | 1.6% |
| series_g | 20 | 0.8% |
| Other values (18) | 72 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5090 | |
| s | 4360 | |
| r | 2034 | 11.3% |
| _ | 2034 | 11.3% |
| i | 1923 | 10.7% |
| d | 706 | 3.9% |
| a | 697 | 3.9% |
| b | 516 | 2.9% |
| c | 340 | 1.9% |
| p | 112 | 0.6% |
| Other values (9) | 161 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17973 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 5090 | |
| s | 4360 | |
| r | 2034 | 11.3% |
| _ | 2034 | 11.3% |
| i | 1923 | 10.7% |
| d | 706 | 3.9% |
| a | 697 | 3.9% |
| b | 516 | 2.9% |
| c | 340 | 1.9% |
| p | 112 | 0.6% |
| Other values (9) | 161 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17973 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 5090 | |
| s | 4360 | |
| r | 2034 | 11.3% |
| _ | 2034 | 11.3% |
| i | 1923 | 10.7% |
| d | 706 | 3.9% |
| a | 697 | 3.9% |
| b | 516 | 2.9% |
| c | 340 | 1.9% |
| p | 112 | 0.6% |
| Other values (9) | 161 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17973 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 5090 | |
| s | 4360 | |
| r | 2034 | 11.3% |
| _ | 2034 | 11.3% |
| i | 1923 | 10.7% |
| d | 706 | 3.9% |
| a | 697 | 3.9% |
| b | 516 | 2.9% |
| c | 340 | 1.9% |
| p | 112 | 0.6% |
| Other values (9) | 161 | 0.9% |
financing_type_tags
Categorical
Imbalance 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 MiB |
| equity | 12653 |
|---|---|
| grant | 5684 |
| ipo | 1298 |
| donation | 836 |
| Other values (29) | 2777 |
Length
| Max length | 26 |
|---|---|
| Median length | 0 |
| Mean length | 0.23838247 |
| Min length | 0 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 589662 | ||
| equity | 12653 | 2.1% |
| grant | 5684 | 0.9% |
| ipo | 1298 | 0.2% |
| donation | 836 | 0.1% |
| debt | 819 | 0.1% |
| equity, grant | 381 | 0.1% |
| equity, series_a | 357 | 0.1% |
| equity, series_b | 273 | < 0.1% |
| seed, equity | 239 | < 0.1% |
| Other values (24) | 708 | 0.1% |
Length
| Value | Count | Frequency (%) |
| equity | 14591 | |
| grant | 6099 | |
| ipo | 1314 | 5.2% |
| debt | 916 | 3.6% |
| donation | 843 | 3.3% |
| seed | 462 | 1.8% |
| series_a | 360 | 1.4% |
| series_b | 274 | 1.1% |
| series_c | 164 | 0.7% |
| series_d | 92 | 0.4% |
| Other values (7) | 111 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 22449 | |
| e | 18438 | |
| i | 17720 | |
| u | 14591 | |
| q | 14591 | |
| y | 14591 | |
| n | 7817 | 5.4% |
| a | 7334 | 5.0% |
| r | 7068 | 4.8% |
| g | 6134 | 4.2% |
| Other values (13) | 15374 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 146107 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 22449 | |
| e | 18438 | |
| i | 17720 | |
| u | 14591 | |
| q | 14591 | |
| y | 14591 | |
| n | 7817 | 5.4% |
| a | 7334 | 5.0% |
| r | 7068 | 4.8% |
| g | 6134 | 4.2% |
| Other values (13) | 15374 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 146107 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 22449 | |
| e | 18438 | |
| i | 17720 | |
| u | 14591 | |
| q | 14591 | |
| y | 14591 | |
| n | 7817 | 5.4% |
| a | 7334 | 5.0% |
| r | 7068 | 4.8% |
| g | 6134 | 4.2% |
| Other values (13) | 15374 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 146107 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 22449 | |
| e | 18438 | |
| i | 17720 | |
| u | 14591 | |
| q | 14591 | |
| y | 14591 | |
| n | 7817 | 5.4% |
| a | 7334 | 5.0% |
| r | 7068 | 4.8% |
| g | 6134 | 4.2% |
| Other values (13) | 15374 |
headcount
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 698 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.549999 |
| Minimum | -10 |
|---|---|
| Maximum | 1750000 |
| Zeros | 606682 |
| Zeros (%) | 99.0% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | -10 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1750000 |
| Range | 1750010 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5958.167 |
|---|---|
| Coefficient of variation (CV) | 59.851 |
| Kurtosis | 20898.392 |
| Mean | 99.549999 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 120.24378 |
| Sum | 61015190 |
| Variance | 35499754 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 606682 | |
| 100 | 408 | 0.1% |
| 200 | 270 | < 0.1% |
| 50 | 213 | < 0.1% |
| 1000 | 211 | < 0.1% |
| 300 | 203 | < 0.1% |
| 500 | 191 | < 0.1% |
| 400 | 171 | < 0.1% |
| 150 | 136 | < 0.1% |
| 250 | 111 | < 0.1% |
| Other values (688) | 4314 | 0.7% |
| Value | Count | Frequency (%) |
| -10 | 1 | < 0.1% |
| 0 | 606682 | |
| 1 | 6 | < 0.1% |
| 2 | 6 | < 0.1% |
| 3 | 5 | < 0.1% |
| 4 | 7 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1750000 | 1 | < 0.1% |
| 1000000 | 1 | < 0.1% |
| 950000 | 1 | < 0.1% |
| 800000 | 3 | |
| 780000 | 1 | < 0.1% |
| 750000 | 1 | < 0.1% |
| 742000 | 1 | < 0.1% |
| 700000 | 1 | < 0.1% |
| 670000 | 1 | < 0.1% |
| 665000 | 1 | < 0.1% |
job_title
Text
Missing 
| Distinct | 33591 |
|---|---|
| Distinct (%) | 47.2% |
| Missing | 541772 |
| Missing (%) | 88.4% |
| Memory size | 27.1 MiB |
Length
| Max length | 212 |
|---|---|
| Median length | 144 |
| Mean length | 27.304928 |
| Min length | 2 |
Unique
| Unique | 28819 ? |
|---|---|
| Unique (%) | 40.5% |
Sample
| 1st row | CEO and chairman |
|---|---|
| 2nd row | Chief Technology Officer |
| 3rd row | Unipart Group Executive Chairman |
| 4th row | Managing Director for North America |
| 5th row | Non-Executive Chair |
| Value | Count | Frequency (%) |
| of | 19960 | 7.3% |
| and | 13887 | 5.0% |
| chief | 12854 | 4.7% |
| director | 12602 | 4.6% |
| president | 12441 | 4.5% |
| officer | 11366 | 4.1% |
| vice | 8288 | 3.0% |
| executive | 6849 | 2.5% |
| head | 6389 | 2.3% |
| ceo | 5150 | 1.9% |
| Other values (8129) | 165342 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 219331 | 11.3% |
| 203959 | 10.5% | |
| i | 160195 | 8.2% |
| r | 144525 | 7.4% |
| n | 128951 | 6.6% |
| a | 127942 | 6.6% |
| o | 114724 | 5.9% |
| t | 110419 | 5.7% |
| c | 93579 | 4.8% |
| s | 75885 | 3.9% |
| Other values (101) | 562908 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1942418 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 219331 | 11.3% |
| 203959 | 10.5% | |
| i | 160195 | 8.2% |
| r | 144525 | 7.4% |
| n | 128951 | 6.6% |
| a | 127942 | 6.6% |
| o | 114724 | 5.9% |
| t | 110419 | 5.7% |
| c | 93579 | 4.8% |
| s | 75885 | 3.9% |
| Other values (101) | 562908 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1942418 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 219331 | 11.3% |
| 203959 | 10.5% | |
| i | 160195 | 8.2% |
| r | 144525 | 7.4% |
| n | 128951 | 6.6% |
| a | 127942 | 6.6% |
| o | 114724 | 5.9% |
| t | 110419 | 5.7% |
| c | 93579 | 4.8% |
| s | 75885 | 3.9% |
| Other values (101) | 562908 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1942418 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 219331 | 11.3% |
| 203959 | 10.5% | |
| i | 160195 | 8.2% |
| r | 144525 | 7.4% |
| n | 128951 | 6.6% |
| a | 127942 | 6.6% |
| o | 114724 | 5.9% |
| t | 110419 | 5.7% |
| c | 93579 | 4.8% |
| s | 75885 | 3.9% |
| Other values (101) | 562908 |
job_title_tags
Text
| Distinct | 1039 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 40.9 MiB |
Length
| Max length | 124 |
|---|---|
| Median length | 0 |
| Mean length | 3.321672 |
| Min length | 0 |
Unique
| Unique | 379 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row | support |
| Value | Count | Frequency (%) |
| directors | 35477 | |
| general_technology | 21288 | |
| information_technology | 14300 | |
| education | 14183 | 8.4% |
| marketing | 13893 | 8.2% |
| support | 11366 | 6.7% |
| management | 11107 | 6.6% |
| finance | 10673 | 6.3% |
| engineering | 7584 | 4.5% |
| sales | 5063 | 3.0% |
| Other values (9) | 24115 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 222059 | |
| n | 204089 | 10.0% |
| o | 184751 | 9.1% |
| t | 166490 | 8.2% |
| r | 159839 | 7.9% |
| i | 142536 | 7.0% |
| a | 138467 | 6.8% |
| g | 101070 | 5.0% |
| c | 100714 | 4.9% |
| s | 88475 | 4.3% |
| Other values (14) | 527396 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2035886 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 222059 | |
| n | 204089 | 10.0% |
| o | 184751 | 9.1% |
| t | 166490 | 8.2% |
| r | 159839 | 7.9% |
| i | 142536 | 7.0% |
| a | 138467 | 6.8% |
| g | 101070 | 5.0% |
| c | 100714 | 4.9% |
| s | 88475 | 4.3% |
| Other values (14) | 527396 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2035886 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 222059 | |
| n | 204089 | 10.0% |
| o | 184751 | 9.1% |
| t | 166490 | 8.2% |
| r | 159839 | 7.9% |
| i | 142536 | 7.0% |
| a | 138467 | 6.8% |
| g | 101070 | 5.0% |
| c | 100714 | 4.9% |
| s | 88475 | 4.3% |
| Other values (14) | 527396 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2035886 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 222059 | |
| n | 204089 | 10.0% |
| o | 184751 | 9.1% |
| t | 166490 | 8.2% |
| r | 159839 | 7.9% |
| i | 142536 | 7.0% |
| a | 138467 | 6.8% |
| g | 101070 | 5.0% |
| c | 100714 | 4.9% |
| s | 88475 | 4.3% |
| Other values (14) | 527396 |
location
Text
Missing 
| Distinct | 13642 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 444747 |
| Missing (%) | 72.6% |
| Memory size | 30.5 MiB |
Length
| Max length | 89 |
|---|---|
| Median length | 55 |
| Mean length | 19.30075 |
| Min length | 2 |
Unique
| Unique | 8228 ? |
|---|---|
| Unique (%) | 4.9% |
Sample
| 1st row | United Kingdom |
|---|---|
| 2nd row | Hungary |
| 3rd row | Sydney, Australia |
| 4th row | Boston, Massachusetts, United States |
| 5th row | Boston, Massachusetts, United States |
| Value | Count | Frequency (%) |
| united | 83681 | 18.9% |
| states | 67540 | 15.3% |
| kingdom | 14321 | 3.2% |
| new | 13826 | 3.1% |
| australia | 12583 | 2.8% |
| india | 10237 | 2.3% |
| california | 7952 | 1.8% |
| york | 7531 | 1.7% |
| texas | 4959 | 1.1% |
| canada | 4610 | 1.0% |
| Other values (10402) | 215031 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 307773 | 9.5% |
| t | 291749 | 9.0% |
| e | 277950 | 8.6% |
| 274104 | 8.4% | |
| i | 252300 | 7.8% |
| n | 243016 | 7.5% |
| s | 155516 | 4.8% |
| d | 150219 | 4.6% |
| , | 141693 | 4.4% |
| o | 124331 | 3.8% |
| Other values (158) | 1027021 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3245672 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 307773 | 9.5% |
| t | 291749 | 9.0% |
| e | 277950 | 8.6% |
| 274104 | 8.4% | |
| i | 252300 | 7.8% |
| n | 243016 | 7.5% |
| s | 155516 | 4.8% |
| d | 150219 | 4.6% |
| , | 141693 | 4.4% |
| o | 124331 | 3.8% |
| Other values (158) | 1027021 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3245672 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 307773 | 9.5% |
| t | 291749 | 9.0% |
| e | 277950 | 8.6% |
| 274104 | 8.4% | |
| i | 252300 | 7.8% |
| n | 243016 | 7.5% |
| s | 155516 | 4.8% |
| d | 150219 | 4.6% |
| , | 141693 | 4.4% |
| o | 124331 | 3.8% |
| Other values (158) | 1027021 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3245672 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 307773 | 9.5% |
| t | 291749 | 9.0% |
| e | 277950 | 8.6% |
| 274104 | 8.4% | |
| i | 252300 | 7.8% |
| n | 243016 | 7.5% |
| s | 155516 | 4.8% |
| d | 150219 | 4.6% |
| , | 141693 | 4.4% |
| o | 124331 | 3.8% |
| Other values (158) | 1027021 |
product
Text
Missing 
| Distinct | 203976 |
|---|---|
| Distinct (%) | 92.4% |
| Missing | 392072 |
| Missing (%) | 64.0% |
| Memory size | 36.7 MiB |
Length
| Max length | 332 |
|---|---|
| Median length | 198 |
| Mean length | 33.377757 |
| Min length | 1 |
Unique
| Unique | 194141 ? |
|---|---|
| Unique (%) | 87.9% |
Sample
| 1st row | Mobiliti app |
|---|---|
| 2nd row | two new kits, At-Home Essentials and Office Collaboration Room-as-a-Service |
| 3rd row | Model 3800 |
| 4th row | Share Purchase Plan |
| 5th row | major assay program at Minbrie |
| Value | Count | Frequency (%) |
| for | 29562 | 2.7% |
| of | 26489 | 2.4% |
| the | 26311 | 2.4% |
| and | 22056 | 2.0% |
| to | 15337 | 1.4% |
| in | 12190 | 1.1% |
| program | 7900 | 0.7% |
| on | 7706 | 0.7% |
| new | 7108 | 0.7% |
| series | 7032 | 0.6% |
| Other values (95507) | 924505 |
Most occurring characters
| Value | Count | Frequency (%) |
| 865914 | 11.7% | |
| e | 672364 | 9.1% |
| a | 472125 | 6.4% |
| o | 469662 | 6.4% |
| i | 466851 | 6.3% |
| t | 449310 | 6.1% |
| r | 444828 | 6.0% |
| n | 414583 | 5.6% |
| s | 341892 | 4.6% |
| l | 265735 | 3.6% |
| Other values (420) | 2507813 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7371077 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 865914 | 11.7% | |
| e | 672364 | 9.1% |
| a | 472125 | 6.4% |
| o | 469662 | 6.4% |
| i | 466851 | 6.3% |
| t | 449310 | 6.1% |
| r | 444828 | 6.0% |
| n | 414583 | 5.6% |
| s | 341892 | 4.6% |
| l | 265735 | 3.6% |
| Other values (420) | 2507813 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7371077 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 865914 | 11.7% | |
| e | 672364 | 9.1% |
| a | 472125 | 6.4% |
| o | 469662 | 6.4% |
| i | 466851 | 6.3% |
| t | 449310 | 6.1% |
| r | 444828 | 6.0% |
| n | 414583 | 5.6% |
| s | 341892 | 4.6% |
| l | 265735 | 3.6% |
| Other values (420) | 2507813 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7371077 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 865914 | 11.7% | |
| e | 672364 | 9.1% |
| a | 472125 | 6.4% |
| o | 469662 | 6.4% |
| i | 466851 | 6.3% |
| t | 449310 | 6.1% |
| r | 444828 | 6.0% |
| n | 414583 | 5.6% |
| s | 341892 | 4.6% |
| l | 265735 | 3.6% |
| Other values (420) | 2507813 |
Missing 
| Distinct | 205544 |
|---|---|
| Distinct (%) | 93.1% |
| Missing | 392134 |
| Missing (%) | 64.0% |
| Memory size | 37.2 MiB |
Length
| Max length | 332 |
|---|---|
| Median length | 203 |
| Mean length | 35.405116 |
| Min length | 2 |
Unique
| Unique | 196702 ? |
|---|---|
| Unique (%) | 89.1% |
Sample
| 1st row | Mobiliti app |
|---|---|
| 2nd row | two new kits, At-Home Essentials and Office Collaboration Room-as-a-Service |
| 3rd row | Model 3800 |
| 4th row | Share Purchase Plan |
| 5th row | major assay program at Minbrie |
| Value | Count | Frequency (%) |
| for | 30912 | 2.7% |
| the | 28365 | 2.5% |
| of | 27854 | 2.4% |
| and | 22906 | 2.0% |
| to | 15893 | 1.4% |
| in | 12532 | 1.1% |
| a | 8867 | 0.8% |
| program | 8464 | 0.7% |
| new | 8135 | 0.7% |
| on | 8030 | 0.7% |
| Other values (97436) | 976985 |
Most occurring characters
| Value | Count | Frequency (%) |
| 928574 | 11.9% | |
| e | 715179 | 9.1% |
| a | 504107 | 6.4% |
| o | 495616 | 6.3% |
| i | 493741 | 6.3% |
| t | 478633 | 6.1% |
| r | 470119 | 6.0% |
| n | 437772 | 5.6% |
| s | 360718 | 4.6% |
| l | 287397 | 3.7% |
| Other values (423) | 2644744 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7816600 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 928574 | 11.9% | |
| e | 715179 | 9.1% |
| a | 504107 | 6.4% |
| o | 495616 | 6.3% |
| i | 493741 | 6.3% |
| t | 478633 | 6.1% |
| r | 470119 | 6.0% |
| n | 437772 | 5.6% |
| s | 360718 | 4.6% |
| l | 287397 | 3.7% |
| Other values (423) | 2644744 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7816600 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 928574 | 11.9% | |
| e | 715179 | 9.1% |
| a | 504107 | 6.4% |
| o | 495616 | 6.3% |
| i | 493741 | 6.3% |
| t | 478633 | 6.1% |
| r | 470119 | 6.0% |
| n | 437772 | 5.6% |
| s | 360718 | 4.6% |
| l | 287397 | 3.7% |
| Other values (423) | 2644744 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7816600 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 928574 | 11.9% | |
| e | 715179 | 9.1% |
| a | 504107 | 6.4% |
| o | 495616 | 6.3% |
| i | 493741 | 6.3% |
| t | 478633 | 6.1% |
| r | 470119 | 6.0% |
| n | 437772 | 5.6% |
| s | 360718 | 4.6% |
| l | 287397 | 3.7% |
| Other values (423) | 2644744 |
Missing 
| Distinct | 14025 |
|---|---|
| Distinct (%) | 92.8% |
| Missing | 597801 |
| Missing (%) | 97.5% |
| Memory size | 24.0 MiB |
Length
| Max length | 185 |
|---|---|
| Median length | 105 |
| Mean length | 15.293137 |
| Min length | 1 |
Unique
| Unique | 13209 ? |
|---|---|
| Unique (%) | 87.4% |
Sample
| 1st row | Stronger Through MentHERship |
|---|---|
| 2nd row | Fieldbook |
| 3rd row | Ask the Doctor |
| 4th row | Unipart Signite |
| 5th row | SUVs |
| Value | Count | Frequency (%) |
| the | 1387 | 3.8% |
| of | 413 | 1.1% |
| for | 398 | 1.1% |
| market | 329 | 0.9% |
| and | 307 | 0.8% |
| in | 214 | 0.6% |
| to | 213 | 0.6% |
| a | 181 | 0.5% |
| on | 124 | 0.3% |
| ai | 109 | 0.3% |
| Other values (14719) | 32516 |
Most occurring characters
| Value | Count | Frequency (%) |
| 21205 | 9.2% | |
| e | 20608 | 8.9% |
| a | 14434 | 6.2% |
| i | 13116 | 5.7% |
| o | 12999 | 5.6% |
| r | 12683 | 5.5% |
| t | 12347 | 5.3% |
| n | 11407 | 4.9% |
| s | 8614 | 3.7% |
| l | 7652 | 3.3% |
| Other values (92) | 95999 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 231064 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 21205 | 9.2% | |
| e | 20608 | 8.9% |
| a | 14434 | 6.2% |
| i | 13116 | 5.7% |
| o | 12999 | 5.6% |
| r | 12683 | 5.5% |
| t | 12347 | 5.3% |
| n | 11407 | 4.9% |
| s | 8614 | 3.7% |
| l | 7652 | 3.3% |
| Other values (92) | 95999 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 231064 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 21205 | 9.2% | |
| e | 20608 | 8.9% |
| a | 14434 | 6.2% |
| i | 13116 | 5.7% |
| o | 12999 | 5.6% |
| r | 12683 | 5.5% |
| t | 12347 | 5.3% |
| n | 11407 | 4.9% |
| s | 8614 | 3.7% |
| l | 7652 | 3.3% |
| Other values (92) | 95999 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 231064 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 21205 | 9.2% | |
| e | 20608 | 8.9% |
| a | 14434 | 6.2% |
| i | 13116 | 5.7% |
| o | 12999 | 5.6% |
| r | 12683 | 5.5% |
| t | 12347 | 5.3% |
| n | 11407 | 4.9% |
| s | 8614 | 3.7% |
| l | 7652 | 3.3% |
| Other values (92) | 95999 |
Missing 
| Distinct | 539 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 586093 |
| Missing (%) | 95.6% |
| Memory size | 24.2 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 20 |
| Mean length | 7.7260693 |
| Min length | 4 |
Unique
| Unique | 343 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | features |
|---|---|
| 2nd row | models |
| 3rd row | section |
| 4th row | collection |
| 5th row | feature |
| Value | Count | Frequency (%) |
| version | 4471 | |
| line | 2664 | 8.8% |
| feature | 2272 | 7.5% |
| model | 2227 | 7.3% |
| update | 2066 | 6.8% |
| edition | 1970 | 6.5% |
| generation | 1934 | 6.4% |
| features | 1485 | 4.9% |
| collection | 1353 | 4.4% |
| list | 1278 | 4.2% |
| Other values (360) | 8703 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 35556 | |
| i | 18335 | |
| n | 18315 | |
| o | 16438 | 7.9% |
| t | 16354 | 7.9% |
| r | 13005 | 6.3% |
| s | 12969 | 6.3% |
| a | 12324 | 5.9% |
| l | 11645 | 5.6% |
| d | 11000 | 5.3% |
| Other values (25) | 41249 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 207190 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 35556 | |
| i | 18335 | |
| n | 18315 | |
| o | 16438 | 7.9% |
| t | 16354 | 7.9% |
| r | 13005 | 6.3% |
| s | 12969 | 6.3% |
| a | 12324 | 5.9% |
| l | 11645 | 5.6% |
| d | 11000 | 5.3% |
| Other values (25) | 41249 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 207190 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 35556 | |
| i | 18335 | |
| n | 18315 | |
| o | 16438 | 7.9% |
| t | 16354 | 7.9% |
| r | 13005 | 6.3% |
| s | 12969 | 6.3% |
| a | 12324 | 5.9% |
| l | 11645 | 5.6% |
| d | 11000 | 5.3% |
| Other values (25) | 41249 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 207190 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 35556 | |
| i | 18335 | |
| n | 18315 | |
| o | 16438 | 7.9% |
| t | 16354 | 7.9% |
| r | 13005 | 6.3% |
| s | 12969 | 6.3% |
| a | 12324 | 5.9% |
| l | 11645 | 5.6% |
| d | 11000 | 5.3% |
| Other values (25) | 41249 |
product_data.release_version
Text
Missing 
| Distinct | 326 |
|---|---|
| Distinct (%) | 51.8% |
| Missing | 612281 |
| Missing (%) | 99.9% |
| Memory size | 23.4 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 11 |
| Mean length | 3.5532591 |
| Min length | 1 |
Unique
| Unique | 238 ? |
|---|---|
| Unique (%) | 37.8% |
Sample
| 1st row | 4.00 |
|---|---|
| 2nd row | 18.3 |
| 3rd row | 1.57.0 |
| 4th row | 1.0 |
| 5th row | 3.0 |
| Value | Count | Frequency (%) |
| 2.0 | 45 | 7.2% |
| 3.0 | 23 | 3.7% |
| 1.0 | 16 | 2.5% |
| 5 | 15 | 2.4% |
| 2 | 13 | 2.1% |
| 2022 | 12 | 1.9% |
| 1.1 | 10 | 1.6% |
| 9 | 10 | 1.6% |
| 3 | 10 | 1.6% |
| 2020 | 9 | 1.4% |
| Other values (315) | 466 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 585 | |
| 2 | 340 | |
| 1 | 330 | |
| 0 | 327 | |
| 3 | 138 | 6.2% |
| 4 | 121 | 5.4% |
| 5 | 114 | 5.1% |
| 6 | 75 | 3.4% |
| 7 | 70 | 3.1% |
| 8 | 68 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2235 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 585 | |
| 2 | 340 | |
| 1 | 330 | |
| 0 | 327 | |
| 3 | 138 | 6.2% |
| 4 | 121 | 5.4% |
| 5 | 114 | 5.1% |
| 6 | 75 | 3.4% |
| 7 | 70 | 3.1% |
| 8 | 68 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2235 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 585 | |
| 2 | 340 | |
| 1 | 330 | |
| 0 | 327 | |
| 3 | 138 | 6.2% |
| 4 | 121 | 5.4% |
| 5 | 114 | 5.1% |
| 6 | 75 | 3.4% |
| 7 | 70 | 3.1% |
| 8 | 68 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2235 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 585 | |
| 2 | 340 | |
| 1 | 330 | |
| 0 | 327 | |
| 3 | 138 | 6.2% |
| 4 | 121 | 5.4% |
| 5 | 114 | 5.1% |
| 6 | 75 | 3.4% |
| 7 | 70 | 3.1% |
| 8 | 68 | 3.0% |
product_data.fuzzy_match
Boolean
High correlation  Imbalance  Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 392134 |
| Missing (%) | 64.0% |
| Memory size | 24.2 MiB |
| True | |
|---|---|
| False | 7042 |
| (Missing) |
| Value | Count | Frequency (%) |
| True | 213734 | |
| False | 7042 | 1.1% |
| (Missing) | 392134 |
product_tags
Text
| Distinct | 580 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 40.4 MiB |
Length
| Max length | 83 |
|---|---|
| Median length | 0 |
| Mean length | 2.3865543 |
| Min length | 0 |
Unique
| Unique | 189 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | mobile, online_technology |
| 5th row |
| Value | Count | Frequency (%) |
| online_technology | 25132 | |
| general_technology | 21288 | |
| marketing | 13893 | |
| campaigns | 12817 | |
| mobile | 11476 | |
| future_tech | 8967 | 7.4% |
| report | 6872 | 5.6% |
| video | 5093 | 4.2% |
| data | 3199 | 2.6% |
| games | 3089 | 2.5% |
| Other values (8) | 9979 | 8.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 183440 | |
| n | 148690 | 10.2% |
| o | 143507 | 9.8% |
| l | 107660 | 7.4% |
| g | 97668 | 6.7% |
| t | 96057 | 6.6% |
| a | 77951 | 5.3% |
| i | 77554 | 5.3% |
| c | 73911 | 5.1% |
| r | 63525 | 4.3% |
| Other values (14) | 392780 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1462743 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 183440 | |
| n | 148690 | 10.2% |
| o | 143507 | 9.8% |
| l | 107660 | 7.4% |
| g | 97668 | 6.7% |
| t | 96057 | 6.6% |
| a | 77951 | 5.3% |
| i | 77554 | 5.3% |
| c | 73911 | 5.1% |
| r | 63525 | 4.3% |
| Other values (14) | 392780 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1462743 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 183440 | |
| n | 148690 | 10.2% |
| o | 143507 | 9.8% |
| l | 107660 | 7.4% |
| g | 97668 | 6.7% |
| t | 96057 | 6.6% |
| a | 77951 | 5.3% |
| i | 77554 | 5.3% |
| c | 73911 | 5.1% |
| r | 63525 | 4.3% |
| Other values (14) | 392780 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1462743 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 183440 | |
| n | 148690 | 10.2% |
| o | 143507 | 9.8% |
| l | 107660 | 7.4% |
| g | 97668 | 6.7% |
| t | 96057 | 6.6% |
| a | 77951 | 5.3% |
| i | 77554 | 5.3% |
| c | 73911 | 5.1% |
| r | 63525 | 4.3% |
| Other values (14) | 392780 |
recognition
Text
Missing 
| Distinct | 22237 |
|---|---|
| Distinct (%) | 88.6% |
| Missing | 587807 |
| Missing (%) | 95.9% |
| Memory size | 25.5 MiB |
Length
| Max length | 429 |
|---|---|
| Median length | 204 |
| Mean length | 45.497789 |
| Min length | 2 |
Unique
| Unique | 20959 ? |
|---|---|
| Unique (%) | 83.5% |
Sample
| 1st row | Transport and Storage sector winner |
|---|---|
| 2nd row | Best Place to Work |
| 3rd row | top 50 highest growth companies in Massachusetts |
| 4th row | 2025 Partner of the Year for Community Impact |
| 5th row | one of two APAC region finalists |
| Value | Count | Frequency (%) |
| the | 13966 | 7.5% |
| in | 11211 | 6.0% |
| of | 8982 | 4.8% |
| best | 6681 | 3.6% |
| for | 6178 | 3.3% |
| top | 5483 | 2.9% |
| one | 3118 | 1.7% |
| year | 3107 | 1.7% |
| and | 2613 | 1.4% |
| leader | 1797 | 1.0% |
| Other values (12032) | 124077 |
Most occurring characters
| Value | Count | Frequency (%) |
| 162056 | ||
| e | 103455 | 9.1% |
| o | 77006 | 6.7% |
| t | 76476 | 6.7% |
| n | 69086 | 6.0% |
| r | 67146 | 5.9% |
| i | 66465 | 5.8% |
| a | 64227 | 5.6% |
| s | 54826 | 4.8% |
| l | 33294 | 2.9% |
| Other values (174) | 368094 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1142131 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 162056 | ||
| e | 103455 | 9.1% |
| o | 77006 | 6.7% |
| t | 76476 | 6.7% |
| n | 69086 | 6.0% |
| r | 67146 | 5.9% |
| i | 66465 | 5.8% |
| a | 64227 | 5.6% |
| s | 54826 | 4.8% |
| l | 33294 | 2.9% |
| Other values (174) | 368094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1142131 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 162056 | ||
| e | 103455 | 9.1% |
| o | 77006 | 6.7% |
| t | 76476 | 6.7% |
| n | 69086 | 6.0% |
| r | 67146 | 5.9% |
| i | 66465 | 5.8% |
| a | 64227 | 5.6% |
| s | 54826 | 4.8% |
| l | 33294 | 2.9% |
| Other values (174) | 368094 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1142131 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 162056 | ||
| e | 103455 | 9.1% |
| o | 77006 | 6.7% |
| t | 76476 | 6.7% |
| n | 69086 | 6.0% |
| r | 67146 | 5.9% |
| i | 66465 | 5.8% |
| a | 64227 | 5.6% |
| s | 54826 | 4.8% |
| l | 33294 | 2.9% |
| Other values (174) | 368094 |
vulnerability
Text
Missing 
| Distinct | 9657 |
|---|---|
| Distinct (%) | 77.2% |
| Missing | 600409 |
| Missing (%) | 98.0% |
| Memory size | 24.2 MiB |
Length
| Max length | 297 |
|---|---|
| Median length | 140 |
| Mean length | 40.645548 |
| Min length | 4 |
Unique
| Unique | 9110 ? |
|---|---|
| Unique (%) | 72.9% |
Sample
| 1st row | disasters and ransomware attacks |
|---|---|
| 2nd row | terrorist attack |
| 3rd row | possible securities law violations |
| 4th row | severe breathing disorder |
| 5th row | cyber attack |
| Value | Count | Frequency (%) |
| of | 4104 | 5.6% |
| and | 1812 | 2.5% |
| the | 1616 | 2.2% |
| to | 1364 | 1.9% |
| breach | 1251 | 1.7% |
| attack | 1038 | 1.4% |
| its | 1024 | 1.4% |
| practices | 872 | 1.2% |
| violations | 823 | 1.1% |
| in | 801 | 1.1% |
| Other values (9203) | 58278 |
Most occurring characters
| Value | Count | Frequency (%) |
| 60471 | ||
| e | 44405 | 8.7% |
| a | 41739 | 8.2% |
| i | 38154 | 7.5% |
| t | 37146 | 7.3% |
| r | 30449 | 6.0% |
| o | 29253 | 5.8% |
| n | 28856 | 5.7% |
| s | 28709 | 5.7% |
| l | 24442 | 4.8% |
| Other values (152) | 144486 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 508110 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 60471 | ||
| e | 44405 | 8.7% |
| a | 41739 | 8.2% |
| i | 38154 | 7.5% |
| t | 37146 | 7.3% |
| r | 30449 | 6.0% |
| o | 29253 | 5.8% |
| n | 28856 | 5.7% |
| s | 28709 | 5.7% |
| l | 24442 | 4.8% |
| Other values (152) | 144486 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 508110 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 60471 | ||
| e | 44405 | 8.7% |
| a | 41739 | 8.2% |
| i | 38154 | 7.5% |
| t | 37146 | 7.3% |
| r | 30449 | 6.0% |
| o | 29253 | 5.8% |
| n | 28856 | 5.7% |
| s | 28709 | 5.7% |
| l | 24442 | 4.8% |
| Other values (152) | 144486 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 508110 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 60471 | ||
| e | 44405 | 8.7% |
| a | 41739 | 8.2% |
| i | 38154 | 7.5% |
| t | 37146 | 7.3% |
| r | 30449 | 6.0% |
| o | 29253 | 5.8% |
| n | 28856 | 5.7% |
| s | 28709 | 5.7% |
| l | 24442 | 4.8% |
| Other values (152) | 144486 |
relationships.company1.data.id
Text
Missing 
| Distinct | 93622 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 14768 |
| Missing (%) | 2.4% |
| Memory size | 58.2 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 63872 ? |
|---|---|
| Unique (%) | 10.7% |
Sample
| 1st row | 000bd323-1bf8-5c7a-9941-e6c155c29d10 |
|---|---|
| 2nd row | 000ff896-4292-5b15-9c81-8bf4d76c10d7 |
| 3rd row | 000d8a9c-882c-57f2-8b4c-2afc786d0fa1 |
| 4th row | 0008b75f-9d15-54ae-b70a-52301945e397 |
| 5th row | 000d8a9c-882c-57f2-8b4c-2afc786d0fa1 |
| Value | Count | Frequency (%) |
| c9b92ffd-a2ed-5787-8b06-f4bf8d291e57 | 13888 | 2.3% |
| 7b7dbf17-a2ad-54cc-ac66-20995d7f6fba | 9156 | 1.5% |
| c5fbb072-cd84-558e-bec2-83c92923d638 | 3590 | 0.6% |
| d1667f69-ea53-5059-9f52-39fd8eb4696c | 2462 | 0.4% |
| f6054cea-799a-55b6-83f5-a6efd07ce108 | 2441 | 0.4% |
| f0124b95-85b2-53c6-8b39-21f4b3704e31 | 2310 | 0.4% |
| b53ca40e-954c-5aed-8e1c-27533975d94a | 2155 | 0.4% |
| 0a2cb7a4-6a6b-5a3c-b1bc-edbe4dafe698 | 1709 | 0.3% |
| 78217424-aa3a-52b9-8b79-29f3eef99218 | 1577 | 0.3% |
| 02c951c6-a850-5b07-8c80-07e086d6a30f | 1575 | 0.3% |
| Other values (93612) | 557279 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 2392568 | 11.1% |
| 5 | 1679166 | 7.8% |
| b | 1347783 | 6.3% |
| 8 | 1285184 | 6.0% |
| 9 | 1275989 | 5.9% |
| a | 1231024 | 5.7% |
| f | 1188875 | 5.5% |
| 0 | 1186341 | 5.5% |
| d | 1155629 | 5.4% |
| 7 | 1154363 | 5.4% |
| Other values (7) | 7636190 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 21533112 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 2392568 | 11.1% |
| 5 | 1679166 | 7.8% |
| b | 1347783 | 6.3% |
| 8 | 1285184 | 6.0% |
| 9 | 1275989 | 5.9% |
| a | 1231024 | 5.7% |
| f | 1188875 | 5.5% |
| 0 | 1186341 | 5.5% |
| d | 1155629 | 5.4% |
| 7 | 1154363 | 5.4% |
| Other values (7) | 7636190 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 21533112 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 2392568 | 11.1% |
| 5 | 1679166 | 7.8% |
| b | 1347783 | 6.3% |
| 8 | 1285184 | 6.0% |
| 9 | 1275989 | 5.9% |
| a | 1231024 | 5.7% |
| f | 1188875 | 5.5% |
| 0 | 1186341 | 5.5% |
| d | 1155629 | 5.4% |
| 7 | 1154363 | 5.4% |
| Other values (7) | 7636190 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 21533112 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 2392568 | 11.1% |
| 5 | 1679166 | 7.8% |
| b | 1347783 | 6.3% |
| 8 | 1285184 | 6.0% |
| 9 | 1275989 | 5.9% |
| a | 1231024 | 5.7% |
| f | 1188875 | 5.5% |
| 0 | 1186341 | 5.5% |
| d | 1155629 | 5.4% |
| 7 | 1154363 | 5.4% |
| Other values (7) | 7636190 |
relationships.company1.data.type
Categorical
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 14768 |
| Missing (%) | 2.4% |
| Memory size | 42.0 MiB |
| company |
|---|
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | company |
|---|---|
| 2nd row | company |
| 3rd row | company |
| 4th row | company |
| 5th row | company |
Common Values
| Value | Count | Frequency (%) |
| company | 598142 | |
| (Missing) | 14768 | 2.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| company | 598142 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 598142 | |
| o | 598142 | |
| m | 598142 | |
| p | 598142 | |
| a | 598142 | |
| n | 598142 | |
| y | 598142 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4186994 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| c | 598142 | |
| o | 598142 | |
| m | 598142 | |
| p | 598142 | |
| a | 598142 | |
| n | 598142 | |
| y | 598142 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4186994 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| c | 598142 | |
| o | 598142 | |
| m | 598142 | |
| p | 598142 | |
| a | 598142 | |
| n | 598142 | |
| y | 598142 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4186994 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| c | 598142 | |
| o | 598142 | |
| m | 598142 | |
| p | 598142 | |
| a | 598142 | |
| n | 598142 | |
| y | 598142 |
| Distinct | 577277 |
|---|---|
| Distinct (%) | 94.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 545374 ? |
|---|---|
| Unique (%) | 89.0% |
Sample
| 1st row | d172abc1-3755-4cef-946e-7de944806e7d |
|---|---|
| 2nd row | 58c0d5fd-068d-4bab-8ac4-47e19bbdf091 |
| 3rd row | ef330a38-8624-41c1-8b75-d1b96e7dbd45 |
| 4th row | 0525807d-6ff6-44a0-9c36-8be3afceba5b |
| 5th row | 16061c55-111d-496a-9e3e-837dddc3454b |
| Value | Count | Frequency (%) |
| 13d038e6-ecaf-4a2f-8ad7-2fd093f8d090 | 11 | < 0.1% |
| a0711e5b-a9c0-49e4-9744-9003a5749d25 | 10 | < 0.1% |
| d904a811-ac52-4c51-9d06-5d01053fe74d | 9 | < 0.1% |
| db9fd525-b9bb-4702-beae-0e1c49f35458 | 8 | < 0.1% |
| ce230631-885d-4de9-b998-c329373caef2 | 8 | < 0.1% |
| ea5a9ab0-7f09-4b5c-a9b6-3e3d538ddd01 | 8 | < 0.1% |
| 2dfc4cd6-aa00-41fb-9dde-773e92e0385e | 7 | < 0.1% |
| 1a95db5e-bcdf-4c8a-9e5e-16a5572c2449 | 7 | < 0.1% |
| ad3cad1e-7a01-48a8-9ade-8845235dda3a | 7 | < 0.1% |
| 24d07b45-24ea-4a5b-9658-ab2109426c19 | 7 | < 0.1% |
| Other values (577267) | 612828 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 2451640 | 11.1% |
| 4 | 1762099 | 8.0% |
| b | 1303896 | 5.9% |
| 8 | 1303515 | 5.9% |
| a | 1303513 | 5.9% |
| 9 | 1302294 | 5.9% |
| 1 | 1150222 | 5.2% |
| d | 1149956 | 5.2% |
| f | 1149706 | 5.2% |
| e | 1149238 | 5.2% |
| Other values (7) | 8038681 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22064760 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 2451640 | 11.1% |
| 4 | 1762099 | 8.0% |
| b | 1303896 | 5.9% |
| 8 | 1303515 | 5.9% |
| a | 1303513 | 5.9% |
| 9 | 1302294 | 5.9% |
| 1 | 1150222 | 5.2% |
| d | 1149956 | 5.2% |
| f | 1149706 | 5.2% |
| e | 1149238 | 5.2% |
| Other values (7) | 8038681 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22064760 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 2451640 | 11.1% |
| 4 | 1762099 | 8.0% |
| b | 1303896 | 5.9% |
| 8 | 1303515 | 5.9% |
| a | 1303513 | 5.9% |
| 9 | 1302294 | 5.9% |
| 1 | 1150222 | 5.2% |
| d | 1149956 | 5.2% |
| f | 1149706 | 5.2% |
| e | 1149238 | 5.2% |
| Other values (7) | 8038681 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22064760 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 2451640 | 11.1% |
| 4 | 1762099 | 8.0% |
| b | 1303896 | 5.9% |
| 8 | 1303515 | 5.9% |
| a | 1303513 | 5.9% |
| 9 | 1302294 | 5.9% |
| 1 | 1150222 | 5.2% |
| d | 1149956 | 5.2% |
| f | 1149706 | 5.2% |
| e | 1149238 | 5.2% |
| Other values (7) | 8038681 |
relationships.company2.data.id
Text
Missing 
| Distinct | 79877 |
|---|---|
| Distinct (%) | 39.8% |
| Missing | 412313 |
| Missing (%) | 67.3% |
| Memory size | 35.1 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 61065 ? |
|---|---|
| Unique (%) | 30.4% |
Sample
| 1st row | aef53cb0-e89a-516c-88d3-3df6460f2f09 |
|---|---|
| 2nd row | 000ae291-51dd-5d17-bb97-cd0750c7675f |
| 3rd row | 0001407c-15b5-5a59-80e5-dc427b2cb490 |
| 4th row | 47148fcc-136b-5085-bdfc-71609d1a6a35 |
| 5th row | d72cbc23-afe0-55fa-897c-83555e7286e2 |
| Value | Count | Frequency (%) |
| c9b92ffd-a2ed-5787-8b06-f4bf8d291e57 | 3555 | 1.8% |
| d30b0ee6-9c54-575f-8cd5-2b3f222da03c | 1085 | 0.5% |
| 7b7dbf17-a2ad-54cc-ac66-20995d7f6fba | 745 | 0.4% |
| 0cc44ad0-9545-549a-a4ff-90dfc3dd04f6 | 556 | 0.3% |
| b293c847-1724-595c-abe7-6a7fdd0f6fa2 | 473 | 0.2% |
| bcf61a51-73ad-53d9-893b-06a8587b052b | 390 | 0.2% |
| 8e53f01f-e59c-5511-a8fc-63186faaa69e | 388 | 0.2% |
| b53ca40e-954c-5aed-8e1c-27533975d94a | 385 | 0.2% |
| e5c30cf5-d1ea-59e8-a604-a29af2d2bd51 | 357 | 0.2% |
| 006ad83b-cff8-58b0-bf20-ff8c0fb399c8 | 327 | 0.2% |
| Other values (79867) | 192336 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 802388 | 11.1% |
| 5 | 570115 | 7.9% |
| b | 440696 | 6.1% |
| 8 | 429305 | 5.9% |
| 9 | 425090 | 5.9% |
| a | 417970 | 5.8% |
| 0 | 391213 | 5.4% |
| f | 388363 | 5.4% |
| d | 386474 | 5.4% |
| c | 381112 | 5.3% |
| Other values (7) | 2588766 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7221492 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 802388 | 11.1% |
| 5 | 570115 | 7.9% |
| b | 440696 | 6.1% |
| 8 | 429305 | 5.9% |
| 9 | 425090 | 5.9% |
| a | 417970 | 5.8% |
| 0 | 391213 | 5.4% |
| f | 388363 | 5.4% |
| d | 386474 | 5.4% |
| c | 381112 | 5.3% |
| Other values (7) | 2588766 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7221492 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 802388 | 11.1% |
| 5 | 570115 | 7.9% |
| b | 440696 | 6.1% |
| 8 | 429305 | 5.9% |
| 9 | 425090 | 5.9% |
| a | 417970 | 5.8% |
| 0 | 391213 | 5.4% |
| f | 388363 | 5.4% |
| d | 386474 | 5.4% |
| c | 381112 | 5.3% |
| Other values (7) | 2588766 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7221492 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 802388 | 11.1% |
| 5 | 570115 | 7.9% |
| b | 440696 | 6.1% |
| 8 | 429305 | 5.9% |
| 9 | 425090 | 5.9% |
| a | 417970 | 5.8% |
| 0 | 391213 | 5.4% |
| f | 388363 | 5.4% |
| d | 386474 | 5.4% |
| c | 381112 | 5.3% |
| Other values (7) | 2588766 |
relationships.company2.data.type
Categorical
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 412313 |
| Missing (%) | 67.3% |
| Memory size | 38.9 MiB |
| company |
|---|
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | company |
|---|---|
| 2nd row | company |
| 3rd row | company |
| 4th row | company |
| 5th row | company |
Common Values
| Value | Count | Frequency (%) |
| company | 200597 | |
| (Missing) | 412313 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| company | 200597 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 200597 | |
| o | 200597 | |
| m | 200597 | |
| p | 200597 | |
| a | 200597 | |
| n | 200597 | |
| y | 200597 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1404179 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| c | 200597 | |
| o | 200597 | |
| m | 200597 | |
| p | 200597 | |
| a | 200597 | |
| n | 200597 | |
| y | 200597 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1404179 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| c | 200597 | |
| o | 200597 | |
| m | 200597 | |
| p | 200597 | |
| a | 200597 | |
| n | 200597 | |
| y | 200597 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1404179 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| c | 200597 | |
| o | 200597 | |
| m | 200597 | |
| p | 200597 | |
| a | 200597 | |
| n | 200597 | |
| y | 200597 |
domain
Text
Missing 
| Distinct | 93622 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 14768 |
| Missing (%) | 2.4% |
| Memory size | 44.9 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 38 |
| Mean length | 12.805911 |
| Min length | 4 |
Unique
| Unique | 63872 ? |
|---|---|
| Unique (%) | 10.7% |
Sample
| 1st row | unipart.com |
|---|---|
| 2nd row | oosinternational.com |
| 3rd row | nwncarousel.com |
| 4th row | grape.solutions |
| 5th row | nwncarousel.com |
| Value | Count | Frequency (%) |
| amazon.com | 13888 | 2.3% |
| apple.com | 9156 | 1.5% |
| asus.com | 3590 | 0.6% |
| hyundaiusa.com | 2462 | 0.4% |
| marvel.com | 2441 | 0.4% |
| binance.com | 2310 | 0.4% |
| citigroup.com | 2155 | 0.4% |
| abb.com | 1709 | 0.3% |
| lg.com | 1577 | 0.3% |
| marriott.com | 1575 | 0.3% |
| Other values (93611) | 557279 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 887505 | |
| c | 719553 | 9.4% |
| . | 686221 | 9.0% |
| m | 616923 | 8.1% |
| a | 572742 | 7.5% |
| e | 524858 | 6.9% |
| r | 388423 | 5.1% |
| n | 376760 | 4.9% |
| i | 373381 | 4.9% |
| s | 336504 | 4.4% |
| Other values (30) | 2176883 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7659753 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 887505 | |
| c | 719553 | 9.4% |
| . | 686221 | 9.0% |
| m | 616923 | 8.1% |
| a | 572742 | 7.5% |
| e | 524858 | 6.9% |
| r | 388423 | 5.1% |
| n | 376760 | 4.9% |
| i | 373381 | 4.9% |
| s | 336504 | 4.4% |
| Other values (30) | 2176883 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7659753 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 887505 | |
| c | 719553 | 9.4% |
| . | 686221 | 9.0% |
| m | 616923 | 8.1% |
| a | 572742 | 7.5% |
| e | 524858 | 6.9% |
| r | 388423 | 5.1% |
| n | 376760 | 4.9% |
| i | 373381 | 4.9% |
| s | 336504 | 4.4% |
| Other values (30) | 2176883 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7659753 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 887505 | |
| c | 719553 | 9.4% |
| . | 686221 | 9.0% |
| m | 616923 | 8.1% |
| a | 572742 | 7.5% |
| e | 524858 | 6.9% |
| r | 388423 | 5.1% |
| n | 376760 | 4.9% |
| i | 373381 | 4.9% |
| s | 336504 | 4.4% |
| Other values (30) | 2176883 |
company_name
Text
Missing 
| Distinct | 91492 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 14875 |
| Missing (%) | 2.4% |
| Memory size | 46.9 MiB |
Length
| Max length | 199 |
|---|---|
| Median length | 76 |
| Mean length | 15.670797 |
| Min length | 2 |
Unique
| Unique | 61651 ? |
|---|---|
| Unique (%) | 10.3% |
Sample
| 1st row | Unipart Manufacturing Group |
|---|---|
| 2nd row | OOS International |
| 3rd row | NWN Corporation |
| 4th row | Grape Solutions Plc. |
| 5th row | NWN Corporation |
| Value | Count | Frequency (%) |
| inc | 82173 | 6.1% |
| group | 47046 | 3.5% |
| ltd | 30639 | 2.3% |
| limited | 23107 | 1.7% |
| university | 19803 | 1.5% |
| of | 19753 | 1.5% |
| amazon.com | 13889 | 1.0% |
| llc | 13700 | 1.0% |
| international | 12793 | 1.0% |
| pty | 12053 | 0.9% |
| Other values (67365) | 1066344 |
Most occurring characters
| Value | Count | Frequency (%) |
| 743265 | 7.9% | |
| e | 698779 | 7.5% |
| n | 634879 | 6.8% |
| o | 621110 | 6.6% |
| a | 606001 | 6.5% |
| i | 575233 | 6.1% |
| r | 542288 | 5.8% |
| t | 533619 | 5.7% |
| s | 375059 | 4.0% |
| l | 327650 | 3.5% |
| Other values (224) | 3713802 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9371685 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 743265 | 7.9% | |
| e | 698779 | 7.5% |
| n | 634879 | 6.8% |
| o | 621110 | 6.6% |
| a | 606001 | 6.5% |
| i | 575233 | 6.1% |
| r | 542288 | 5.8% |
| t | 533619 | 5.7% |
| s | 375059 | 4.0% |
| l | 327650 | 3.5% |
| Other values (224) | 3713802 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9371685 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 743265 | 7.9% | |
| e | 698779 | 7.5% |
| n | 634879 | 6.8% |
| o | 621110 | 6.6% |
| a | 606001 | 6.5% |
| i | 575233 | 6.1% |
| r | 542288 | 5.8% |
| t | 533619 | 5.7% |
| s | 375059 | 4.0% |
| l | 327650 | 3.5% |
| Other values (224) | 3713802 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9371685 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 743265 | 7.9% | |
| e | 698779 | 7.5% |
| n | 634879 | 6.8% |
| o | 621110 | 6.6% |
| a | 606001 | 6.5% |
| i | 575233 | 6.1% |
| r | 542288 | 5.8% |
| t | 533619 | 5.7% |
| s | 375059 | 4.0% |
| l | 327650 | 3.5% |
| Other values (224) | 3713802 |
ticker
Text
Missing 
| Distinct | 3491 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 479215 |
| Missing (%) | 78.2% |
| Memory size | 27.7 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 9.0011968 |
| Min length | 2 |
Unique
| Unique | 1282 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | ASX:VRL |
|---|---|
| 2nd row | LON:INCH |
| 3rd row | ASX:CNI |
| 4th row | ASX:BEM |
| 5th row | ASX:BEM |
| Value | Count | Frequency (%) |
| nasdaq:amzn | 13888 | 10.4% |
| nsdq:aapl | 9156 | 6.8% |
| otcpk:hymlf | 2462 | 1.8% |
| nyse:c | 2155 | 1.6% |
| swx:abbn | 1709 | 1.3% |
| nasdaq:mar | 1575 | 1.2% |
| nyse:jll | 1439 | 1.1% |
| nasdaq:eei | 1329 | 1.0% |
| otc:fxcof | 1237 | 0.9% |
| nasdaq:eric | 1213 | 0.9% |
| Other values (3483) | 97611 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 154902 | |
| N | 142212 | |
| S | 140332 | |
| : | 133190 | |
| E | 74504 | 6.2% |
| D | 65906 | 5.5% |
| Y | 57720 | 4.8% |
| Q | 55001 | 4.6% |
| T | 37377 | 3.1% |
| M | 35750 | 3.0% |
| Other values (33) | 306521 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1203415 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 154902 | |
| N | 142212 | |
| S | 140332 | |
| : | 133190 | |
| E | 74504 | 6.2% |
| D | 65906 | 5.5% |
| Y | 57720 | 4.8% |
| Q | 55001 | 4.6% |
| T | 37377 | 3.1% |
| M | 35750 | 3.0% |
| Other values (33) | 306521 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1203415 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 154902 | |
| N | 142212 | |
| S | 140332 | |
| : | 133190 | |
| E | 74504 | 6.2% |
| D | 65906 | 5.5% |
| Y | 57720 | 4.8% |
| Q | 55001 | 4.6% |
| T | 37377 | 3.1% |
| M | 35750 | 3.0% |
| Other values (33) | 306521 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1203415 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 154902 | |
| N | 142212 | |
| S | 140332 | |
| : | 133190 | |
| E | 74504 | 6.2% |
| D | 65906 | 5.5% |
| Y | 57720 | 4.8% |
| Q | 55001 | 4.6% |
| T | 37377 | 3.1% |
| M | 35750 | 3.0% |
| Other values (33) | 306521 |
company
Text
| Distinct | 91357 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 46.7 MiB |
Length
| Max length | 194 |
|---|---|
| Median length | 77 |
| Mean length | 14.911197 |
| Min length | 0 |
Unique
| Unique | 61506 ? |
|---|---|
| Unique (%) | 10.0% |
Sample
| 1st row | Unipart Manufacturing Group |
|---|---|
| 2nd row | OOS International |
| 3rd row | NWN Corporation |
| 4th row | Grape Solutions Plc |
| 5th row | NWN Corporation |
| Value | Count | Frequency (%) |
| inc | 82173 | 6.2% |
| group | 47046 | 3.5% |
| ltd | 30639 | 2.3% |
| limited | 23107 | 1.7% |
| university | 19920 | 1.5% |
| of | 19753 | 1.5% |
| llc | 13993 | 1.1% |
| amazoncom | 13889 | 1.0% |
| international | 12793 | 1.0% |
| pty | 12053 | 0.9% |
| Other values (66765) | 1056206 |
Most occurring characters
| Value | Count | Frequency (%) |
| 733537 | 8.0% | |
| e | 698779 | 7.6% |
| n | 634879 | 6.9% |
| o | 621110 | 6.8% |
| a | 606001 | 6.6% |
| i | 575233 | 6.3% |
| r | 542288 | 5.9% |
| t | 533619 | 5.8% |
| s | 375059 | 4.1% |
| l | 327650 | 3.6% |
| Other values (53) | 3491067 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9139222 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 733537 | 8.0% | |
| e | 698779 | 7.6% |
| n | 634879 | 6.9% |
| o | 621110 | 6.8% |
| a | 606001 | 6.6% |
| i | 575233 | 6.3% |
| r | 542288 | 5.9% |
| t | 533619 | 5.8% |
| s | 375059 | 4.1% |
| l | 327650 | 3.6% |
| Other values (53) | 3491067 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9139222 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 733537 | 8.0% | |
| e | 698779 | 7.6% |
| n | 634879 | 6.9% |
| o | 621110 | 6.8% |
| a | 606001 | 6.6% |
| i | 575233 | 6.3% |
| r | 542288 | 5.9% |
| t | 533619 | 5.8% |
| s | 375059 | 4.1% |
| l | 327650 | 3.6% |
| Other values (53) | 3491067 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9139222 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 733537 | 8.0% | |
| e | 698779 | 7.6% |
| n | 634879 | 6.9% |
| o | 621110 | 6.8% |
| a | 606001 | 6.6% |
| i | 575233 | 6.3% |
| r | 542288 | 5.9% |
| t | 533619 | 5.8% |
| s | 375059 | 4.1% |
| l | 327650 | 3.6% |
| Other values (53) | 3491067 |
locations
Text
| Distinct | 13558 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.8 MiB |
Length
| Max length | 85 |
|---|---|
| Median length | 0 |
| Mean length | 5.0555775 |
| Min length | 0 |
Unique
| Unique | 8139 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | United Kingdom |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | Hungary |
| 5th row |
| Value | Count | Frequency (%) |
| united | 83681 | 18.9% |
| states | 67541 | 15.3% |
| kingdom | 14321 | 3.2% |
| new | 13826 | 3.1% |
| australia | 12583 | 2.8% |
| india | 10237 | 2.3% |
| california | 7952 | 1.8% |
| york | 7531 | 1.7% |
| texas | 4959 | 1.1% |
| canada | 4610 | 1.0% |
| Other values (10327) | 214936 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 307773 | 9.9% |
| t | 291749 | 9.4% |
| e | 277950 | 9.0% |
| 274014 | 8.8% | |
| i | 252300 | 8.1% |
| n | 243016 | 7.8% |
| s | 155516 | 5.0% |
| d | 150219 | 4.8% |
| o | 124331 | 4.0% |
| r | 114479 | 3.7% |
| Other values (53) | 907267 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3098614 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 307773 | 9.9% |
| t | 291749 | 9.4% |
| e | 277950 | 9.0% |
| 274014 | 8.8% | |
| i | 252300 | 8.1% |
| n | 243016 | 7.8% |
| s | 155516 | 5.0% |
| d | 150219 | 4.8% |
| o | 124331 | 4.0% |
| r | 114479 | 3.7% |
| Other values (53) | 907267 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3098614 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 307773 | 9.9% |
| t | 291749 | 9.4% |
| e | 277950 | 9.0% |
| 274014 | 8.8% | |
| i | 252300 | 8.1% |
| n | 243016 | 7.8% |
| s | 155516 | 5.0% |
| d | 150219 | 4.8% |
| o | 124331 | 4.0% |
| r | 114479 | 3.7% |
| Other values (53) | 907267 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3098614 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 307773 | 9.9% |
| t | 291749 | 9.4% |
| e | 277950 | 9.0% |
| 274014 | 8.8% | |
| i | 252300 | 8.1% |
| n | 243016 | 7.8% |
| s | 155516 | 5.0% |
| d | 150219 | 4.8% |
| o | 124331 | 4.0% |
| r | 114479 | 3.7% |
| Other values (53) | 907267 |
Interactions
Correlations
| confidence | amount_normalized | headcount | |
|---|---|---|---|
| confidence | 1.000 | 0.006 | -0.001 |
| amount_normalized | 0.006 | 1.000 | -0.000 |
| headcount | -0.001 | -0.000 | 1.000 |
| amount_normalized | category | confidence | financing_type_normalized | financing_type_tags | headcount | human_approved | planning | product_data.fuzzy_match | |
|---|---|---|---|---|---|---|---|---|---|
| amount_normalized | 1.000 | 0.000 | -0.036 | 1.000 | 0.000 | -0.002 | 0.011 | 0.000 | 1.000 |
| category | 0.000 | 1.000 | 0.108 | 0.000 | 0.196 | 0.059 | 0.389 | 0.127 | 0.018 |
| confidence | -0.036 | 0.108 | 1.000 | 0.051 | 0.016 | 0.002 | 0.037 | 0.042 | 0.009 |
| financing_type_normalized | 1.000 | 0.000 | 0.051 | 1.000 | 0.446 | 1.000 | 0.047 | 0.298 | 1.000 |
| financing_type_tags | 0.000 | 0.196 | 0.016 | 0.446 | 1.000 | 0.000 | 0.192 | 0.119 | 0.025 |
| headcount | -0.002 | 0.059 | 0.002 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 |
| human_approved | 0.011 | 0.389 | 0.037 | 0.047 | 0.192 | 0.000 | 1.000 | 0.090 | 0.011 |
| planning | 0.000 | 0.127 | 0.042 | 0.298 | 0.119 | 0.000 | 0.090 | 1.000 | 0.002 |
| product_data.fuzzy_match | 1.000 | 0.018 | 0.009 | 1.000 | 0.025 | 1.000 | 0.011 | 0.002 | 1.000 |
Missing values
Sample
| Primary_ID | summary | category | found_at | confidence | article_sentence | human_approved | planning | amount | amount_normalized | assets | assets_tags | award | contact | effective_date | event | financing_type | financing_type_normalized | financing_type_tags | headcount | job_title | job_title_tags | location | product | product_data.full_text | product_data.name | product_data.release_type | product_data.release_version | product_data.fuzzy_match | product_tags | recognition | vulnerability | relationships.company1.data.id | relationships.company1.data.type | relationships.most_relevant_source.data.id | relationships.company2.data.id | relationships.company2.data.type | domain | company_name | ticker | company | locations | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0020f127-3470-4cce-8989-1c79f45da217 | Unipart Manufacturing Group recognized as Transport and Storage sector winner. | recognized_as | 2022-07-10T20:00:00Z | 0.8759 | In addition to being named the safest organisation in the UK, Unipart Logistics won the British Safety Council Chief Adjudicator Award for achieving the highest-scoring application of the 647 received from around the world, and was named Transport and Storage sector winner. | False | False | None | NaN | None | None | None | 1970-01-01 | None | None | None | 0 | None | United Kingdom | None | None | None | None | None | None | Transport and Storage sector winner | None | 000bd323-1bf8-5c7a-9941-e6c155c29d10 | company | d172abc1-3755-4cef-946e-7de944806e7d | NaN | NaN | unipart.com | Unipart Manufacturing Group | None | Unipart Manufacturing Group | United Kingdom | ||||
| 1 | 009be1ff-6cfb-4e9f-a415-69baf71f47f3 | OOS International received award two safety awards on Jan 1st '18. | receives_award | 2019-12-19T10:45:17Z | 0.9497 | Since then OOS International has been an active member of the IADC and received two safety awards in 2018. | False | False | None | NaN | None | two safety awards | None | 2018-01-01 | None | None | None | 0 | None | None | None | None | None | None | None | None | None | None | 000ff896-4292-5b15-9c81-8bf4d76c10d7 | company | 58c0d5fd-068d-4bab-8ac4-47e19bbdf091 | NaN | NaN | oosinternational.com | OOS International | None | OOS International | |||||
| 2 | 01444124-7375-4f03-8879-eb8200b31504 | NWN Corporation received award Global Winner for 2022 Microsoft Meetings, Calling & Devices for Microsoft Teams Partner of the Year Award on Jun 28th '22. | receives_award | 2022-07-12T20:00:00Z | 0.6887 | As a result, with nearly 400 nominees from over 100 countries, NWN Corporation is pleased to announce NWN Carousel was recognized as a Global Winner for 2022 Microsoft Meetings, Calling & Devices for Microsoft Teams Partner of the Year Award. | False | False | None | NaN | None | Global Winner for 2022 Microsoft Meetings, Calling & Devices for Microsoft Teams Partner of the Year Award | None | 2022-06-28 | None | None | None | 0 | None | None | None | None | None | None | None | None | None | None | 000d8a9c-882c-57f2-8b4c-2afc786d0fa1 | company | ef330a38-8624-41c1-8b75-d1b96e7dbd45 | NaN | NaN | nwncarousel.com | NWN Corporation | None | NWN Corporation | |||||
| 3 | 031a304c-29ca-415e-a815-e9c915896540 | Grape Solutions Plc. is developing Mobiliti app on Jan 1st '18. | is_developing | 2023-04-02T22:00:00Z | 0.5987 | MVM Mobiliti and Grape Solutions have been working together since 2018 to develop the Mobiliti app, becoming the most downloaded electric car charging app in Hungary, with more than 215,000 charging stations in 39 countries. | False | False | None | NaN | None | None | None | 2018-01-01 | None | None | None | 0 | None | Hungary | Mobiliti app | Mobiliti app | None | None | None | True | mobile, online_technology | None | None | 0008b75f-9d15-54ae-b70a-52301945e397 | company | 0525807d-6ff6-44a0-9c36-8be3afceba5b | NaN | NaN | grape.solutions | Grape Solutions Plc. | None | Grape Solutions Plc | Hungary | |||
| 4 | 037783ca-f3f7-4782-8a81-df3cae1ac936 | NWN Corporation launched two new kits, At-Home Essentials and Office Collaboration Room-as-a-Service on Apr 13th '22. | launches | 2022-04-13T01:02:36Z | 0.7180 | NWN Carousel, the leading integrated cloud communications service provider, today announced two new kits, At-Home Essentials and Office Collaboration Room-as-a-Service, for organizations to manage the accelerating demands of the hybrid workplace with connectivity, security, devices and visual collaboration. | False | False | None | NaN | None | office | None | None | 2022-04-13 | None | None | None | 0 | None | support | None | two new kits, At-Home Essentials and Office Collaboration Room-as-a-Service | two new kits, At-Home Essentials and Office Collaboration Room-as-a-Service | None | None | None | True | None | None | 000d8a9c-882c-57f2-8b4c-2afc786d0fa1 | company | 16061c55-111d-496a-9e3e-837dddc3454b | NaN | NaN | nwncarousel.com | NWN Corporation | None | NWN Corporation | |||
| 5 | 03d14654-015f-4efa-b986-05a6b032e8ea | Gems Sensors, Inc. launches Model 3800. | launches | 2015-03-24T23:00:00Z | 0.5464 | Gems Sensors & Controls announces the global market launch of its Model 3800 and 3820 Series of reliable, accurate, compact OEM pressure transmitters and switches for hazardous area and other hostile environments. | False | False | None | NaN | None | None | None | 1970-01-01 | None | None | None | 0 | None | None | Model 3800 | Model 3800 | None | None | None | True | None | None | 000875e7-0d87-5fe3-9a42-f80baa9f6f01 | company | 52b18c2e-9c26-4280-b3a4-0035340e2de2 | NaN | NaN | gemssensors.com | Gems Sensors, Inc. | None | Gems Sensors Inc | |||||
| 6 | 04143a02-d0a8-4079-97f1-35bc1497bfb9 | Neuroblastoma Australia Incorporated receives financing of $155K in donations. | receives_financing | 2020-09-08T00:25:02Z | 0.9115 | So far Neuroblastoma Australia has received $155,000 in donations which is just incredible. | False | False | $155,000 | 155000.0 | None | None | None | 1970-01-01 | None | donations | None | donation | 0 | None | None | None | None | None | None | None | None | None | None | 0007bc22-874d-5770-afbe-919728e9c3a2 | company | 3810a5f8-f64e-4987-9cc9-bb58c3e2bb13 | NaN | NaN | neuroblastoma.org.au | Neuroblastoma Australia Incorporated | None | Neuroblastoma Australia Incorporated | ||||
| 7 | 0493a8e0-6cb2-4a0c-9cff-9076252a963d | NWN Corporation recognized as Best Place to Work on Jan 1st '22. | recognized_as | 2023-05-01T18:00:00Z | 0.9673 | He’s also leading a company that makes its own employees happy: NWN Carousel was recognized by Comparably as a “Best Place to Work” in 2022.” | False | False | None | NaN | None | None | None | 2022-01-01 | None | None | None | 0 | None | None | None | None | None | None | None | None | Best Place to Work | None | 000d8a9c-882c-57f2-8b4c-2afc786d0fa1 | company | fcc901f9-d9ec-43c4-af27-b7cdca10acfa | NaN | NaN | nwncarousel.com | NWN Corporation | None | NWN Corporation | |||||
| 8 | 0583b4eb-d105-492b-98f9-a4c419b3f5c7 | NWN Corporation hires Jim Sullivan as CEO and chairman. | hires | 2019-05-09T14:04:00Z | 0.4973 | Solution provider NWN Corp. has appointed a new CEO, Jim Sullivan, who had been serving as president of data management software company Actifio. | False | False | None | NaN | None | None | Jim Sullivan | 1970-01-01 | None | None | None | 0 | CEO and chairman | directors | None | None | None | None | None | None | None | None | None | 000d8a9c-882c-57f2-8b4c-2afc786d0fa1 | company | adbaedb0-b528-415b-8e4f-d038ebbbabd5 | NaN | NaN | nwncarousel.com | NWN Corporation | None | NWN Corporation | ||||
| 9 | 0627a827-df60-4ace-9b79-bc98c9bf4c59 | Neuromersiv Pty Ltd receives financing of $1M in grant funding. | receives_financing | 2016-09-07T20:00:00Z | 1.0000 | Neuromersiv secures $1 million to advance MedTech for stroke, spinal cord & brain injury survivors. | False | False | $1m | 1000000.0 | None | None | None | 1970-01-01 | None | grant funding | None | equity, grant | 0 | None | Sydney, Australia | None | None | None | None | None | None | None | None | 0006cb4b-9b5e-57b1-b30b-f8deb2a91e47 | company | 4efd7b5a-93b8-4434-8a5e-bb0197a55b4c | NaN | NaN | neuromersiv.com | Neuromersiv Pty Ltd | None | Neuromersiv Pty Ltd | Sydney Australia |
| Primary_ID | summary | category | found_at | confidence | article_sentence | human_approved | planning | amount | amount_normalized | assets | assets_tags | award | contact | effective_date | event | financing_type | financing_type_normalized | financing_type_tags | headcount | job_title | job_title_tags | location | product | product_data.full_text | product_data.name | product_data.release_type | product_data.release_version | product_data.fuzzy_match | product_tags | recognition | vulnerability | relationships.company1.data.id | relationships.company1.data.type | relationships.most_relevant_source.data.id | relationships.company2.data.id | relationships.company2.data.type | domain | company_name | ticker | company | locations | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 620775 | 9cb896d4-b0b0-41bc-ab1b-d30528432f45 | HBC launches SOREL Footwear pop-up shops. | launches | 2023-09-23T22:00:00Z | 0.0000 | Hudson's Bay is announcing the arrival of exclusive SOREL Footwear pop-up shops at its Hudson's Bay location at Yorkdale Shopping Centre in Toronto as well as its flagship locations in Vancouver and Montreal. | False | False | None | NaN | None | retail | None | None | 1970-01-01 | None | None | None | 0 | None | None | SOREL Footwear pop-up shops | SOREL Footwear pop-up shops | None | None | None | True | None | None | f7120c57-9de9-56ae-8c6b-d770d0a97461 | company | 73e43540-1859-421d-872f-61c5eb207bc1 | NaN | NaN | hbc.com | HBC | None | HBC | ||||
| 620776 | 9d2580bc-9f25-4349-8b52-f8ac9e8452c6 | Access Industries Inc. invested into Ada Health $47M on Oct 31st '17. | invests_into | 2017-10-30T19:00:22Z | 0.7296 | Today, Ada Health announced that it has raised $47 million in venture capital in a funding round led by Access Industries, June Fund, and Berlin-based Cumberland VC. | True | False | $47 million | 4.700000e+07 | None | None | None | 2017-10-31 | None | None | None | 0 | None | London, United Kingdom | None | None | None | None | None | None | None | None | f70b5d8f-24e5-5c3b-84dc-5596d32dccc7 | company | a04b69fa-0380-41d8-9369-693326f56d09 | 60f68693-a447-59a0-8cec-6de8ffae3036 | company | accessindustries.com | Access Industries Inc. | None | Access Industries Inc | London United Kingdom | ||||
| 620777 | 9d3d5f7b-bbec-434f-aab3-770bcb34cf79 | Maxeon Solar Technologies Ltd. launches Sustainability Report. | launches | 2022-06-30T05:00:00Z | 0.5786 | Maxeon Solar Technologies, Ltd. (NASDAQ: MAXN ), a global leader in solar innovation and channels, today announced the release of its Sustainability Report for the year 2021 (Sustainability Report). | False | False | None | NaN | None | None | None | 1970-01-01 | None | None | None | 0 | None | SINGAPORE | Sustainability Report | Sustainability Report | None | None | None | True | report | None | None | f709c720-e19b-51f3-aa9d-86fdb73bd9b9 | company | c63b5d1d-894f-4bc4-a96a-5b3a0be8ea4e | NaN | NaN | maxeon.com | Maxeon Solar Technologies Ltd. | NASDAQ:MAXN | Maxeon Solar Technologies Ltd | SINGAPORE | |||
| 620778 | 9e42289e-6a00-4017-aff9-c192f13e6ba2 | Access Industries Inc. acquired Warner for $3.3B on Jan 1st '11. | acquires | 2020-02-06T04:00:00Z | 0.6556 | Access Industries acquired Warner Music Group for US$3.3bn in 2011. | False | False | $3.3 billion | 3.300000e+09 | None | None | None | 2011-01-01 | None | None | None | 0 | None | None | None | None | None | None | None | None | None | None | f70b5d8f-24e5-5c3b-84dc-5596d32dccc7 | company | c9d64c5a-7a0f-4612-9a85-0c7bebe183e8 | e44a98ca-f7e3-58d9-9ea9-b0d26610cefb | company | accessindustries.com | Access Industries Inc. | None | Access Industries Inc | |||||
| 620779 | 9e5c037a-0bb5-4aa4-bcd1-a022120872c1 | Mineral Fusion partners with Dress for Success. | partners_with | 2023-06-23T16:34:15Z | 0.4978 | Mineral Fusion is a pledge partner with Dress for Success in the Your Hour, Her Power campaign. | False | False | None | NaN | None | None | None | 1970-01-01 | None | None | None | 0 | None | None | None | None | None | None | None | None | None | None | ed44a6e9-a97f-50e2-8f0b-f94d90eb0cc3 | company | c9cede29-f766-4278-9b95-48fc678c7e08 | f712ba14-7ae6-51c6-9ce9-6d9b7f1947d6 | company | mineralfusion.com | Mineral Fusion | None | Mineral Fusion | |||||
| 620780 | 9eb89308-1185-4673-bd0b-71b49793e597 | Cityofhobart partners with Dress for Success. | partners_with | 2024-03-03T23:00:00Z | 0.3128 | As International Women's Day (IWD) draws near, the City of Hobart reaffirms its commitment to gender equality and female empowerment through its ongoing partnership with Dress for Success, an organization dedicated to assisting women in their employment journey. | False | False | None | NaN | None | None | None | 1970-01-01 | None | None | None | 0 | None | None | None | None | None | None | None | None | None | None | a7b8ea84-323f-5841-be92-4549ae87b975 | company | 5a6492cd-4940-406b-b01c-9c4b626bd492 | f712ba14-7ae6-51c6-9ce9-6d9b7f1947d6 | company | cityofhobart.org | Cityofhobart | None | Cityofhobart | |||||
| 620781 | 9fb29c5d-ddfd-4eb1-9827-e544fa1e4591 | Maxeon Solar Technologies Ltd. has issues with securities fraud or other unlawful business practices. | has_issues_with | 2024-07-18T18:20:17Z | 0.5880 | The class action concerns whether Maxeon and certain of its officers and/or directors have engaged in securities fraud or other unlawful business practices. | False | False | None | NaN | None | None | None | 1970-01-01 | None | None | None | 0 | None | None | None | None | None | None | None | None | None | securities fraud or other unlawful business practices | f709c720-e19b-51f3-aa9d-86fdb73bd9b9 | company | 5eaf97cc-65cf-4f8d-a2c0-7d7951bee748 | NaN | NaN | maxeon.com | Maxeon Solar Technologies Ltd. | NASDAQ:MAXN | Maxeon Solar Technologies Ltd | |||||
| 620782 | a0cc659c-f18a-4841-bd56-412bf79e3bb3 | HBC launched update to Companies Creditors Arrangement Act on Mar 21st '25. | launches | 2025-03-25T00:00:00Z | 0.0000 | On Friday, the Hudson's Bay Company announced an update to its Companies' Creditors Arrangement Act (CCAA), which it filed earlier this month. | False | False | None | NaN | None | None | None | 2025-03-21 | None | None | None | 0 | None | None | update to Companies Creditors Arrangement Act | update to its Companies' Creditors Arrangement Act | None | update | None | True | None | None | f7120c57-9de9-56ae-8c6b-d770d0a97461 | company | a1b74fea-9eba-4f6b-8e93-d21668e6aad7 | NaN | NaN | hbc.com | HBC | None | HBC | |||||
| 620783 | a3c344d0-0e68-4a99-90c9-ab2c90253e67 | Xpansiv CBL Holding Group receives financing of $25M in conditional debt financing. | receives_financing | 2021-07-27T16:20:25Z | 0.7883 | In conjunction with the convertible note, Xpansiv has also secured a further US$25M of conditional debt financing from a global investment bank to support future M&A transactions. | False | False | US$25M | 2.500000e+07 | None | None | None | 1970-01-01 | None | conditional debt financing | None | debt | 0 | None | None | None | None | None | None | None | None | None | None | f70cd42a-379b-5362-9f58-f91f6e079adc | company | 4bb794cd-48d3-4c90-a5bc-50e4b277f1c8 | NaN | NaN | xpansiv.com | Xpansiv CBL Holding Group | None | Xpansiv CBL Holding Group | ||||
| 620784 | a3d9d83c-47d6-4c5a-b87c-ae3e52484f01 | Mirada PLC launches Iris TV Everywhere Solution at TV Connect. | launches | 2019-09-05T11:29:00Z | 0.7016 | Mirada will announce its newest release of Iris TV Everywhere Solution at TV Connect in London. | False | False | None | NaN | None | None | None | 1970-01-01 | None | None | None | 0 | None | London, United Kingdom | Iris TV Everywhere Solution at TV Connect | Iris TV Everywhere Solution at TV Connect | None | None | None | True | None | None | f7099284-191e-584f-b598-2128718fd969 | company | 3da0d195-3915-4897-8467-a9678f594329 | NaN | NaN | mirada.tv | Mirada PLC | LON:MIRA | Mirada PLC | London United Kingdom |